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EFFECTS OF INCREMENTS OF REINFORCEMENT 
IN HUMAN PROBABILITY LEARNING! 


MAYNARD W. SHELLY II 


Office of Naval Research 


There are indications in the litera- 
ture that even though relative fre- 
quencies of reinforcement may have 
proportionate effects on response fre- 
quencies asympototically, they may 
not have such proportionate effects 
at preasymptotic levels (e.g., Grant, 
Hake, & Hornseth, 1951). Such 
indications can be observed when 
trial or block number replaces rein- 
forcement conditions as the parameter 
of the family of performance curves, 
and reinforcement conditions replace 
trial or block number on the abscissa 
in the usual graphs of results. From 
similar plots there are also faint 
evidences (e.g., Neimark, 1956) that 
any lack of preasymptotic propor- 


1 This research was conducted while the 
author was at the University of Wisconsin. 
He wishes to express his appreciation to the 
Graduate School of the University of Wis- 
consin for providing funds for the experi- 
mental apparatus employed, and to the Social 
Science Research Council for providing the 
author with a Research Training Fellowship 
for the period during which the experiment 
was completed. He also wishes to thank the 
Office of Naval Research for providing the 
author with time, and the Department of 
Psychology, Northwestern University for 
the facilities to complete the final drafts of 
this paper. 


tionality may be affected by the type 
of reinforcement, i.e., whether it is 
contingent or noncontingent. This 
experiment was conducted to deter- 
mine how the relative frequency of a 
response with some fixed probability 
of reinforcement is affected by varia- 
tions in the probability of reinforce- 
ment of alternative responses. 

To study the effects of variations 
in the probability of making alterna- 
tive responses, this investigation em- 
ployed a situation approximating the 
usual probability learning conditions. 
Three alternatives were used, and 
Ss were asked to predict which of the 
three alternatives would be correct 
on a given trial. None, one, two, 
or three alternatives could be correct 
on any one trial. Because of the 
indications that contingent and non- 
contingent reinforcement might pro- 
duce different effects on another alter- 
native, even for a constant probability 
of reinforcement, both conditions of 
reinforcement were used. 

This study uses three, rather than 
two, response alternatives for two 
principal reasons. The first is that 
with three alternatives it is possible 
to have more sizable variations in the 
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sum of the probabilities of reinforce- 
ment on two of the three alternatives 
and still have the probability of 


reinforcement on the third a maxi- 
mum. Secondly, it is only when three 
or more alternatives are used that 
contingent and noncontingent rein- 
forcement are fully differentiated. 


METHOD 


Subjects.—The Ss were 170 students (ap- 
proximately evenly divided between women 
and men) from elementary psychology courses 
at the University of Wisconsin. 

A pparatus.—The apparatus consisted of a 
booth with a flat black interior in which S sat, 
and equipment outside the booth which 
regulated the experimental conditions. In 
the booth an aluminum chassis box (8 in. & 17 
in.) was mounted on a sloping table. It 
contained 16 four-pole snap switches, above 
each of which was mounted a 6-v. green- 
domed lamp. For the conditions of this 
study, only the three switches and their 
associated lamps were uncovered. A yellow- 
domed 6-v. lamp was also mounted in front 
of S. This informed him that a new trial 
had begun. When it went off, S was to make 
a choice of one of the three switches before 
him. 

The equipment outside of the booth con- 
sisted largely of a timing motor, Western 
Union tape transmitters, and lamps on a 
panel which permitted E to record S’s re- 
sponses. There were also a number of rotary 
switches which permitted E to change the 
correspondence between channels of the 
Western Union tape transmitters and S’s 
response alternatives. 

Design.—The experiment consisted of 17 
conditions, 10 different Ss serving in each 
condition. In all 17 conditions at least one 
alternative would lead to a correct prediction 
on 75% of the trials if always chosen. This 
alternative was combined with two other 
response alternatives. The reinforcement 
probabilities on these other two alternatives 
were combinations of 0, .25, .50, and .75. 
Random binary sequences were established 
independently for each alternative, so that 
even when the probabilities summed to one, 
there were trials on which was 
correct and trials on which more 
switches would be correct. 

The two types of reinforcement were 
contingent and noncontingent reinforcement. 
When S served under contingent reinforce- 


no switch 
one or 
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TABLE 1 


COMBINATIONS OF REINFORCEMENT PERCENT- 
AGES USED IN THE EXPERIMENTAL 
CONDITIONS 


~—Ins 
wuuMmwnu 


Note.—The numbers in the table represent the 
percentages of times the first, second, and third alter- 
natives, respectively, would be reinforced if constantly 
chosen. 

* This condition was not completed under noncon- 
tingent reinforcement because Ss were responding 
perfectly after only a few trials, and a factor of boredom 
would have been too strong a confounding influence. 


ment, he was only informed as to whether or 
not he had made a correct prediction. If he 
had been correct, the lamp above the 
alternative he had chosen came on. If he had 
been incorrect in his prediction, no lamp came 
on. When noncontingent reinforcement was 
used, a lamp came on above each correct 
alternative (if there were any) irrespective of 
whether S had made a correct or incorrect 
prediction. 

The design is presented in Table 1. Here 
the first “75” refers to the presence of an 
alternative which was reinforced 75% of the 
time. The two numbers following the initial 
“75” give the percentages of reinforcements 
which would be received were either ot the 
other alternatives constantly chosen. The 
effects of any consistent position preferences 
among Ss were eliminated by permuting the 
positions of the various alternatives among Ss. 

Procedure-—The Ss in this experiment 
served individually. When they reported to 
the laboratory, they were assigned to one of 
the 17 conditions on a random basis with the 
restriction that no more than 10 Ss be assigned 
to any one condition. Upon being seated in 
the experimental booth, all Ss were given 
the same standard set of instructions (except 
for modifications appropriate to the conditions 
of reinforcement). These instructions told 
Ss that the experiment was designed to study 
their abilities to predict future events, and 
that the object of the experiment was to make 
as many correct predictions as possible. Also 
included were instructions regarding the 
mechanics of the experiment. It was strongly 
suggested that they were to make one choice 
on each trial, and never more than one. A 
brief demonstration was then given. 

Each S then made a total of 320 predic- 
tions. Each trial was of 4-sec. duration. 
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RESULTS 


In Fig. 1 are presented the per- 
formance curves for the 75% alterna- 
tive for the conditions of the present 
experiment. At the termination of 
each curve are two numbers separated 
by a hyphen. These numbers repre- 
sent the percentages of times that the 
two alternatives not plotted for that 
group could lead to reinforcement in 
100 trials. The number in parentheses 
associated with each curve refers to 
the asymptote which would be ap- 
proached were S responding propor- 
tionately (in the following sense) to 
the percentage of reinforcement. A 
will be 
proportionate if 


considered to be 
the proportion of 


response 


times an alternative is chosen is equal 
to the ratio of the number of times 
that reinforced 
to the sum of the number of times 
all alternatives reinforced. 
For the 5 condition the ratio 
for the % alternative would be 
75/(75 + 25 25). As can be seen, 
the departures from such _ propor- 
tionality are greatest when the de- 
nominator of the ratio is 
smallest. 

In the introduction it was stated 
that there are suggestions that incre- 
ments in amount of reinforcement 
assigned to other alternatives do not 
always have proportionate decremen- 
tal effects on the response alternative 
being analyzed. Such a suggestion 
seems to be further confirmed by the 
performance curves in Fig. 1. The Ds 
in Fig. 1 are the distances between 
curves when the totals of the two 
numbers identifying the curves in- 
volved differ by 25. (To keep the 
graphs relatively simple only two 
of the possible Ds are shown.) To 
consider an example, take Part a 
of Fig. 1; here 25+ 0 and 50+ 0 
differ by 25 and an arrow is drawn 


alternative can be 


can be 
5-25-2 


75 
759 


above 
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between the two curves (for the last 
block of trials only) and labeled by 
D. It can be observed that the D 
distances are not always proportionate 
to the differences between the num- 
bers included in the parentheses. In 
an attempt to analyze such relation- 
ships more formally, the following 
type of decision procedure was used. 

We can specify the percentages of 
reinforcement for two alternatives and 
let the third vary over the values 0, 
25, 50, and 75, which stand for per- 
centages of reinforcement. Denote 
the third alternative, which can vary, 
by X. Then, using { } to denote a 
set, {75-0-X} is, as an example, the 
set of conditions {75-0-0, 75-0-25, 
75-0-50, 75-0-75}. Such sets of condi- 
tions will be referred to as C sets. 
It will be observed that when the 
members of a C set are written in 
the above ordering, the percentage of 
reinforcement for the last alternative 
increases by steps of 25; i.e., the X 
values increase by steps of 25. By 
using the D distances as ordinate 
values, and X values on the abscissa 
for a fixed C set of conditions we get 
a family of curves, the parameter 
of which is trial block number. For 
each of the two types of reinforcement 
there are four C sets: {75-0-X}, 
{75-25-S}, {75-50-X}, and {75-75-X}. 
Clearly, the conditions of the experi- 
ment usually do not belong to only 
one C set. 

Because X assume four val- 
ues, there are three D distances pos- 
sible, and hence each curve, for a 
given block of trials, based upon a 
given C set, contains three points, 
the first of which is at an X value of 
25. Each of these three points can 
be interpreted as the decrease in 
number of times the first alternative 
(the alternative which in every condi- 
tion could lead to a reinforcement 
75% of the time) is responded to as a 


can 
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Fic. 1. Proportion of responses to the 75% alternative in all conditions for Trial Blocks 
2, 4, 6, and 8. The division of performance curves between Parts 2 and b, and between Parts 
c and d is simply to make them more easily readable. 
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result of an increase in the X value 
by 25, e.g., from 0 to 25, or from 
25 to 50. It can easily be computed, 
using the definition of proportionality 
in this paper, that the maximum 
point on each of these curves should 
be the first point, i.e., the D dis- 
tances between 75-0-0 and 75-0-25 
for example. 

Families of curves based upon the 
C sets were plotted for the second, 
fourth, sixth, and eighth block of 40 
trials. 
hypotheses were then tested. The 
first was that twice the maximum 
point on each curve was greater than 
the sum of the other two. This simply) 
tested the statistical hypothesis that 


Sequences ot one or two 


there was a statistically significant 
maximum point. The _ procedures 
used in this and the following hypoth- 
esis are outlined in Grant 
(1956). For example, if the second 
point were the greatest, the hypothesis 
tested would be: 


those 


0 [1] 


where yo, for example, is the mean 
for that condition of the C set for 
which X = 0. 

If this hypothesis were not rejected, 
then the hypothesis that the sum of 
the two greatest points was greater 
than twice the remaining point would 
be tested. For example, if this 
happened to be the first two versus 
the third point, the hypothesis tested 
would be: 


\Mo — Mos) 1 (M25 — M50) 


) 
— ~\M50 


have the same 


If either one 


where Mo, Mas, etc. 
meanings as in [1 ]. 


of the hypotheses were rejected, then 


would’ be 
two points. 


maximum 
one, or at 
inferred 


a statistical 
reached at 
These 


statistical maxima, 


rABLE 2 


VALUES Or X FOR Wuicu MAx(D) Occurs 


Trial Block Number 


Noncontingent Conditions 


50 50 
50 50 


sss 
wuuwuwv 


sss 


mw 


* For 75-0-0 the value of 1.00 was used for the propor 
tion of responses to the 75% alternative and the sta 
tistical procedures adjusted accordingly 

» For 75-75-75 the value of .333 was used for the 
proportion of responses to the 75% alternative and the 
statistical procedures adjusted accordingly for use of a 
nonempirical value. 


max(D), are given in Table 2. It is 
very that max(D), when it 
occurs, does not always occur at an 
X value of 25. Indeed, 62.5% of 
them occur at an X value of 50. 


evident 


DISCUSSION 


Obviously the more complete implica- 
tions of this study can only be arrived 
at through an extended mathematical 
analysis. There are, however, several 
things about the findings which can be 
emphasized. First, the max(D) values 
do not usually occur at an X value of 25. 
This implies that the probabilities of 
reinforcement are not being responded 
to proportionately. This result 
with the findings of Gardner 
and Cotton and Rechtschaffen (1958). 
Secondly, there is considerable con- 
sistency in the X values at which the 
max(D)s occur for different blocks of 
trials. This implies some consistency 
Thirdly, 
62.5% of the max(D)s occurred at an 
X value of 50. That is, the greatest 
decrease in responding to the 75% 
alternative the 


agrees 
(1957) 


in the relative learning rates. 


occurred when proba- 
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bility of reinforcement on the variable 
alternative in a C set reached maximal 
uncertainty, i.e., changed from a prob- 
ability of .25 to a probability of .5. 


It is possibly this last finding which is 
most suggestive for future research. 


SUMMARY 


An experiment was conducted to deter- 
mine how the relative frequency of a response 
with some fixed probability of reinforcement 
is affected by variations in the probability 
of reinforcement of alternative 
The experimental situation approximated 
the usual probability learning conditions. 
The Ss were asked to predict which of three 
events would occur on any one trial. One 
of the alternative responses had a probability 
of reinforcement of .75 for all Ss. The second 
and third responses had probabilities of 
reinforcement varying from 0 to .75 in steps 
of .25 for the various experimental groups. 
The results were found not to be in agreement 
with a prediction based upon a generalization 
ot the meaning of proportionality for two 
alternative responses. Instead it was found 
that, when the reinforcement value of the 


responses. 
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first two alternative responses was kept 
constant, the greatest decrement in respond- 
ing to the first (the .75 alternative) occurred 
when the probability of reinforcement on the 
third alternative increased from .25 to .5. 
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IDENTIFICATION OF VISUAL CORRELATIONAL 
SCATTERPLOTS ! 


IRWIN 


Operational A pplications Laboratory 


A useful tool for the examination 
of interaural phenomena is the de- 
scription of auditory signals at the 
two ears in terms of their correlation 
with each other (Pollack & Trittipoe, 
1959; Zerlin, 1959). The aim of the 
present paper is to describe initial 
studies with a visual correlational dis- 
play. This display, we believe, pro- 
vides a useful tool for the examination 
of dynamic 
the behavioral 
hypotheses. 


and 
testing of statistical] 


shape perception 


The student of psychological sta- 
familiar with the correla- 
tional scatterplot. If two variables, 
A and B, strongly 
common factor, C, 
scatterplot of A 

confined to a 


tistics is 


share a single 
the correlational 
versus B will be 
small about the 
diagonal. If the two variables weakly 
share the common factor, C, the 
correlational scatterplot may be dif- 
fuse and may 

of the entire A 
terms of Variance, 
the (squared) is equal 
to the ratio of the variance common 
to each of two variables to the total 


area 


cover a large extent 
versus B 
components of 


area. In 


correlation 


variance (common variance plus in- 
dependent 


variance associated with 


each of the two variables). 


Licklider and Dzendolet (1948, 
! This report is AFCRC TR 59-14 of the 
Air Force Cambridge Research Center. This 
research supports Project 7682, Information 
Processing by the Human Operator, of the 
\ir Research and Development Command 
Program in Human Performance. The 
writer wishes to acknowledge the able assist- 
ance of William Trittipoe in all phases of the 
testing program, and John Schjelderup with 
respect to the experimental equipment. 


POLLACK 


, Air Force Cambridge Research Center 


Fig. 3, p. 123) have demonstrated a 
simple procedure for producing cor- 
relational distributions. Three inde- 
pendent random noise generators, A, 
B, and C, are employed. If the out- 
put of Noise Generator A is sent to the 
x coordinate of an oscilloscope and 
the output of Independent Noise 
Generator B is sent to the y coordi- 
nate, a diffuse correlational plot with 
a circular envelope will be obtained. 
Since A and B are independent ran- 
dom distributions, a 
zero will be obtained. If, on the 
other hand, the output of Noise 
Generator C alone is sent to both 
coordinates of the oscilloscope, a 
thin line at 45° will be traced if the 
x and y coordinate gains are equal. 
Since a single distribution is sent to 
both coordinates, a correlation of 1.0 
will be obtained. 

If Cis mixed with A and sent to the 
x coordinate; and if C is mixed with 
B and sent to the y coordinate, inter- 
mediate correlations 
obtained. The greater the relative 
contribution of the common source, 
C, the higher will be the correlation. 
The greater the relative contribution 
of the independent sources, the smaller 
will be the correlation. And, in 
general, the correlation (squared) will 
be equal to the ratio of the common 
noise power, C, over the total noise 
power (C+ A) or (C+ 8B). Lick- 
lider (1948, Fig. 4, p. 155) presents 
oscillographic pictures traced out by 
a rapidly moving electron stream 
for a wide range of correlations. 
Licklider and Dzendolet (1948, Fig. 


2, p. 122) also present oscillographic 


correlation of 


can also be 
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pictures obtained by discrete sampling 
of correlational distributions.’ 


METHOD 


The task of S was to identify one of two 
correlational distributions presented on an 
oscilloscope. In terms of three independent 
noise generators, a reference correlation was 
obtained by routing (C + A) to one coordi- 
nate of the oscilloscope and by routing 
(C+ B) to the other coordinate. In all tests, 
A =B;and(C+A)=(C+B8B). Asecond, 
higher, correlation was obtained by mixing a 
higher level of the common source, C’, with 
the two independent noise sources. The task 
of S was to identify whether the correlation 
was produced by (C + A) and (C + B) or by 
(C’ +A) and (C’+ 8B). In terms of the 
verbalization of S’s task, he was instructed to 
press his voting lever “down” for the ‘‘stub 
bier’ of the two scatterplots and to press his 
voting lever “up” for the “thinner” of the 
two scatterplots. It is important to note that 
the power sum of (C’ + A) was equated to 
(C +A) in order that distributional differ- 
ences, rather than differences in amplitude, 
would serve as the relevant cue for discrimi- 
nation. 

Factors which facilitated rapid learning 
of the task were: (a) immediate confirmation 
of correctness of response was provided; (0) 
a consistent direction of movement of S’s 
voting lever was assigned to the “thinner”’ 
of the two correlations; (c) Ss worked in a 
group and the score of each S was announced 
to the entire group after each set of 30 or 60 
presentations. 

The oscilloscopes employed were Dumont 
Model 304-A, one with a fast blue P-11 
phosphor, one with a medium persistence 
green P-1 phosphor. Since no consistent 
difference was observed between the results 
yielded by the two tubes, only the average 
results are reported. Approximately three- 
quarters of the 5-in. diameter was employed 
for the display. The noise generators were 
Grason-Stadler Model 455 B, set at 20 kcps. 
The noise spectrum was uniform between 100 
and 6800 cps after being passed through a 


2Even the excellent still photographs in 
the two references fail to convey the dazzling 
dancing parade of correlation patterns avail- 
able for study. To the serious reader inter- 
ested in dynamic displays with information 
of well defined statistical properties, a lash-up 
of the noise generators and oscilloscope is 
recommended to taste the full richness and 
flavor of the available materials. 
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band-pass filter with an attenuation charac- 
teristic of 18 db/octave. Low-pass filtering 
was obtained by sharp mz-derived filters, 
Gertsch Model BP-1. A variable phase-shift 
circuit was employed in order to sharpen 
the scope image for high noise frequencies. 
The interval between successive correlation 
displays was 4 sec. 

Discrete samplings from the correlational 
distributions were obtained in the following 
manner. The two correlational distributions 
were first established, and the intensity 
control of the oscilloscope was reduced so 
that a picture was not visible. The intensity 
axis, or 2 axis, of the oscilloscope was then 
pulsed to yield short discrete samples from 
the correlational distribution. The pulse 
duration was 6 usec., obtained from a Measure- 
ments Corporation 79-B pulse generator. 

In all, eight separate experimental series 
(to be described separately) were carried out. 
In each series, a reference correlation (C + A) 
was established, and a wide range of variable 
correlations (C’ + A) was explored in order 
that the point of 75°% correct two-choice 
identification established. In all 
tests except Series IV, the average seating 
distance from the scopes was about 18 in. 
In Series IV, the distance was 54 in. A 
nonpulsed display was employed in Series 
I-V; a sampled discrete-pulsed display was 
employed in Series VI-VIII. 

Series I consisted of four Ss, all laboratory 
personnel experienced in psychophysical 
testing. Series II-V employed six Ss, five 
female college undergraduates and one of the 
Es. Prior to the testing series, the under- 
graduates had not participated in psycho- 
physical studies. Series VI-VIII employed 
six Ss, four female college undergraduates 
and both Es, all experienced in psychophysical 
studies. Each presented datum point for 
for Series II-VIII represents the interpolation 
from a psychophysical function based on 
about 1800 observations—6 Ss, 4-6 variable 
correlations, and 60 observations by each S 
for each variable correlation, except Series V 
the corresponding number of 
was about 5600. 


could be 


observations 


RESULTS 


Series I: Preliminary Tests 


The purpose of Series I was to com- 
pare the results of visual tests with 
the results of auditory tests that were 


noise 
& Trittipoe, 


with the 


(Pollack 


carried out 


distributions 


same 
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1959). In the auditory tests, (C + A) 
or (C’ +A) was presented to one 
earphone and (C + B) or (C’ + B) 
was presented to the other. The 
abscissa of Fig. 1 is the reference 
correlation (squared); the ordinate 
is the difference between the reference 
correlation (squared) and the vari- 
able correlation (squared) required 
for 60, 75, and 90% correct identifica- 
tion. The visual tests are represented 
by solid lines. The thin dashed curve 
represents 60% correct identification 
for the auditory tests. 

Two main points are suggested by 
Fig. 1: (a) In light of the vastly 
different tasks with the auditory 
and visual displays, the form of the 
correlation identification function for 
auditory and visual tests is strikingly 
similar. The 60% correct auditory 
contour nearly overlaps with the 
90% correct visual contour. (bd) 
The three visual correlation identifica- 
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Fic. 1. Comparison of auditory and visual 
correlational displays. The abscissa is the 
reference correlation (squared). The ordinate 
is the difference between the required correla- 
tion (squared) for 60, 75, and 90% cor- 
rect identification and reference correlation 
(squared). The ordinate is scaled logarithmi- 
cally. The dashed line represents the results 
of auditory tests for 60% correct identifica- 
tion. Tests of Series I. 
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tion functions exhibit roughly the 
same general form. Because the two- 
choice identification function is nearly 
linear about its midpoint, the 75% 
correct identification level was se- 
lected to summarize the results of all 
of the other tests. 


Series II: Effect of Duration 


The results of Series II are pre- 
sented in Table 1. A given testing 
condition is defined by the display 
duration (across the top) and the 
reference correlation (along the left 
side). each condition, three 
entries are presented: (a) the C’/A 
ratio in decibels required for 75% 
correct identification when paired 
with the reference correlation on the 
left; (0) 
(squared) ; 


For 


the equivalent correlation 
and (c) the equivalent 
Fisher z transformation score. The 
three corresponding entries are also 
presented for the reference conditions 
on the left. 
The effect of duration is not marked 


except for the conditions of high 


reference correlation where extremely 
large differences were needed to obtain 


75% correct identification. (The 75% 
correct identification level could not 
be obtained for the 10-msec. display 
for the two highest reference correla- 
tions within the ranges of conditions 
available.) 

The ratio between C and C’ (in 
terms of the decibel difference re- 
quired for 75% correct identification) 
furnishes one measure of relative 
sensitivity. In terms of this measure, 
maximum sensitivity is obtained near 
a reference correlation (squared) of 
about 0.5 with reduced sensitivity 
for lower and higher reference corre- 
lations. 

Another measure of sensitivity is 
the difference between the correla- 
tions (squared) required for 75% 
correct identification. The results 
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TABLE 1 
IDENTIFICATION OF Two VISUAL CORRELATIONS AS A FUNCTION OF THE REFERENCE 
CORRELATION AND DispLAyY DURATION 
(For each condition, three entries are presented: (a) the C’/A ratio in decibels 
which was correctly identified 75% of the trials from the reference C/A 
ratio listed on the left; (b) the equivalent correlation coefficient 
(squared) ; and, (c) the equivalent Fisher 
z-score transformation. ) 


Reference Correlation Measure 


C/A —« 
r? .00 
z .00 


C/A —20 db 
r? .O1 

z .10 

C/A —10 db 
r? .09 


z .31 


C/A —5 db 
r? .24 
z .54 


j 


A 0 db 
5 


C/ 
r, 
z .88 
C/A +5 db 
r> .76 
z 1.34 
C/A +10 db 
r? .91 
z 1.87 


C/A +15 db 
r2 .97 
z 2.43 


C/A +20 db 


C/A +25 db 


® .(3)8 .9998. 
b.(4)8 .99998. 


are presented in Fig. 2 in terms of the 
Ar? measure. The lower abscissa is 
the C/A ratio in decibels; the upper 
abscissa is the equivalent correlation 
(squared). The parameter on the 
curves is the display duration. In 


Duration in Milliseconds 


1000 


— 10.0 db 
.09 
31 


— 10.0 db 
.09 
31 

—7.0 db 
17 


16.7 db 
.979 
2.62 
21.2 db 
.992 
3.13 
31.6 db 
.(3)28 
4.3 


terms of this measure, sensitivity 
changes little until a reference correla- 
tion (squared) of about 0.5 with rapid 
improvement, thereafter, for higher 
reference correlations. 

Another measure of sensitivity is 
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the difference between the correla- 
tions in terms of a metric which 
normalizes the sampling variance of 
the correlation coefficient. If succes- 
sive samples are, for example, ob- 
tained from a population correlational 
distribution with a correlation of near 
1.0, the individual sample correlations 
will all be near 1.0. However, if 
successive samples are obtained from 
a population correlational distribution 
with a correlation of nearly zero, the 
sample correlations may vary over a 
wide range. That is, the sampling 
error is much wider for correlations 
near zero than for correlations near 
1.0. The Fisher z transformation 
attempts to equalize the sampling 
variance for all correlations. In 
terms of the Az measure, over most 
of the reference correlations, the range 
of scores is considerably reduced com- 
pared with the range of scores in terms 


REFERENCE CORRELATION, r,* 
| 24 5 % 91 .97 99 .997 
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Fic. 2. 
lations. 


Identification of two visual corre- 
The upper abscissa is the reference 
correlation (squared); the lower abscissa is 
the equivalent C/A ratio in decibels. The 
ordinate is the difference between the re- 
quired correlation (squared) for 75% correct 
identification and the reference correlation 
(squared). The ordinate is scaled logarithmi- 
cally. The parameter is the display duration 
in milliseconds. Tests of Series IT. 
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Fic. 3. Effect of low-pass cutoff frequency 
upon identification of two visual correlations. 
The abscissa is the low-pass cutoff frequency 
in cycles/second. The ordinate is the C’/A 
ratio in decibels required for 75% correct 
identification from a C/A ratio of 
The parameter is the display duration in 
milliseconds. Tests of Series III. 


— ©, 


of Ar? or Ar. We tentatively interpret 
this result to indicate that S’s task is 
to sample the correlational distribu- 
tions and to make a test between alter- 
native statistical hypotheses. That 
performance is also reduced with 
shorter durations also lends support 
to the notion that S is acting as 
a tester of alternative statistical 
hypotheses. 


Series III: Effect of Frequency Range 


The effect of low-pass filtering of 
the noise distributions was examined 
for a zero reference correlation for 
three durations: 10, 100, and 1000 
msec. In terms of the stimulus to 
S, the effect of low-pass filtering is to 
thin out the density of the correla- 
tional distribution upon the oscillo- 
scope. This operation may be alter- 
natively considered from a sampling 
viewpoint as a reduction of the sample 
size available for inspection. In 
terms of the testing of alternative 
statistical hypotheses, we would there- 


’The equivalence of the correlation 
(squared) with the ratio of the noise powers 
may be questioned for restricted bands of 
noise. 
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fore expect poorer discrimination with 
successively deeper cuts in the noise 
frequency distribution. 

The results of the filtering tests are 
presented in Fig. 3. The abscissa is 
the low-pass cutoff frequency. The 
ordinate is the C’/A ratio necessary to 
achieve 75% correct identification. 
The parameter is the duration of the 
correlational display. The reference 
correlation was zero. 

Larger C’/A ratios are 
as the frequency range is 
The effect is most marked 
display of shortest duration. 


required 
reduced. 
with the 


Series IV: Effect of Size 


If a given correlation is established, 
a change in size of one or both axes will 
leave the correlation invariant. If S’s 
task is primarily that of identifying 
differences in correlation, we would 
expect little change in identification 
with changes in size (until acuity 


considerations become important). 


RELATIVE SIZE 
32 10 
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Fic. 4. Effect of size of scatterplot upon 
identification of two correlations. The lower 
abscissa is the attenuation in decibels applied 
to both coordinates of the oscilloscope. The 
upper abscissa is the relative linear size of the 
resulting scatterplots. The ordinate is the 
C’/A ratio in decibels required for 75% 
correct identification from a reference C/A 
ratio of 0 db. The parameter is the display 
duration in milliseconds. Tests of Series IV. 
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The effect of the size of the scatter- 
plot is presented in Fig. 4. The 
ordinate is the C’/A ratio necessary 
for 75% correct identification from a 
C/A ratio of Odb. The lower abscissa 
is the attenuation, in decibels, im- 
posed upon both the x and y coordi- 
nates of the oscilloscope. The upper 
abscissa presents the relative size of 
the scatterplot. The parameter is the 
duration of the display. 

Under the conditions of testing, 5- 
to 10-fold linear reductions in size 
were easily tolerated, but substan- 
tially higher C’/A ratios were required 
for larger size reductions. 


Series V: Comparison of Positive and 
Negative Correlations 


A further comparison between audi- 
tory and visual tests is furnished by 
the identification of positive vs. 
negative correlations. In the auditory 
tests (Pollack & Trittipoe, 1959), 
it was observed that only limited use 
could be made of the direction of 
the interaural correlation in the 
vicinity of zero correlation. Spe- 
cifically, it was found that the abso- 
lute value of the correlation necessary 
to achieve a fixed identification level, 
lr;| was nearly identical under the 
following three testing procedures: 
Procedure A(+7; vs. 7, or zero correla- 
tion); Procedure B (—7; vs. 7.) and 
Procedure C(+7; vs. —7r;). 

With visual correlations, nearly 
complete contribution of the direction 
of the correlation might be expected. 
That is, identification of a positive 
tilt of the correlational scatterplot 
from a negative tilt should be sub- 
stantially better than identification 
of a positive tilt from a zero tilt. 
Series V tests this point. 

In all, four procedures were ex- 
amined in Series V. Procedures A, B, 
and C duplicated the corresponding 
test conditions for the auditory tests. 
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And, since Procedures A and B 
yielded equivalent results, only the 
average of their results will be pre- 
sented as Procedure A:B. In addi- 
tion, Procedure D(+r; vs. 7.) was 
examined to determine whether un- 
certainty of the direction of the 
correlation would retard performance. 
In Procedure D in a random order, r. 
was presented upon one-half of the 
trials, +7; was presented upon one- 
quarter of the trials, and —r; was 
presented upon one-quarter of the 
trials. Only two responses were 
available to the S: “‘stubbier’’ (for 
r.) and “‘thinner”’ (for r;, irrespective 
of the direction of the correlation). 
Each procedure was examined at 
three durations: 10, 100, and 1000 
msec. Table 2 presents the results 
of Procedures A: B, C, and D of Series 
V. The first column presents the 
noise burst duration, in milliseconds. 
The second column represents inter- 
polated identification levels obtained 
from appropriate psychometric func- 
tions. The third, fourth, and sixth 
columns present the r, required for 
the stated level of identification for 
Procedures A:B, C, and D, respec- 
tively. (The initial decimal point 
has omitted.) For example, 
Table 2 says that for 100-msec. noise 
bursts in Procedure C, +r’ of .045 
when paired with —7r? of .045 yielded 
75% correct identifications. The 


been 


ratio between the corresponding en- 


tries for Procedures A:B and C is 
presented in the fifth column. 

Table 2 makes three points: (a) 
Procedure C (positive vs. negative 
correlation) yields the highest sensi- 
tivity in that the required correla- 
tional differences are substantially 
lower than obtained with the other 
procedures. (b) The ratio of the 
correlations (squared) yielded by Pro- 
cedures A:B and C in the fifth column 
of Table 2, is nearly 2 to 1, especially 
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TABLE 2 


SUMMARY OF RESULTS WITH POSITIV! 
AND NEGATIVE CORRELATIONS 


Procedure 


Duration 


Msec. Corr. 


10 
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= 
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Se 2 2 sr 
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~ 


1000 


099 
075 
056 
040 
024 
013 
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for intermediate levels of identifica- 
tion. This result suggests the cor- 
relation (squared) measure is an 
appropriate measure of the correla- 
tional variable in that it is additive 
about zero correlation. (c) Uncer- 
tainty of direction of correlation as- 
sociated with Procedure D has a 
small, but significant, effect in re- 
ducing accuracy of identification. 


Series VI: Effect of Sampling Pulse 
Frequency 


Series I-V considered a correla- 
tional display in which a continuously 
moving electron stream defined the 
distribution. Series VI-VIII con- 
sidered the equivalent noise distribu- 
tions but with the trace visible only 
in terms of discrete dots spaced in 
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Fic. 5. Effect of discrete pulsing fre- 
quency upon identification of two correlations. 
The abscissa is the pulsing frequency, in 
pulses/second. The ordinate is the C’/A 
ratio in decibels required for 75% correct 
identification from each of three reference 
C/A ratios (listed as the parameter). For 
comparison, the results of Table 1 for the 
nonpulsed display of 1000 msec. are presented. 
The duration of the display was 1000 msec. 
Tests of Series VI. 


time. This. was accomplished by 
briefly pulsing the z axis of the oscillo- 
scope. The number of samples from 
the distribution is equal to the number 
of pulses delivered to the z axis within 
the available display interval. The 
number of samples is the product of 
the pulsing rate (impulses per second) 
and the available display duration 
(in seconds). In terms of a statistical 
hypothesis tester, S’s performance 
would be expected to improve as the 
size of the sample is increased. 

Figure 5 presents the effect of 
varying the pulsing rate with a dis- 
play of available duration constant 
at 1.0 sec. For comparison, results 
under corresponding nonpulsed con- 
ditions are also presented from Fig. 2. 

Performance improves substantially 
with higher pulsing frequencies. 


Series VII: Effect of Sampling Pulse 
Number 


The total number of samples, as 
represented by the total number of 


pulses per observation interval, is 
determined by the product of the 
pulsing frequency and the available 
duration of the display. Which of 
the three parameters—number of 
pulses, pulsing frequency, or avail- 
able duration of the display—is the 
critical determinant of performance ? 

Figure 6 presents the results of 
tests carried out over a wide range 
of available display durations and 
pulsing rates. The abscissa is the 
available duration of the display; 
the ordinate is the C’/A ratio in 
decibels required for 75% correct 
identification; the parameter is the 
number of samples (pulses) within 
the display duration. For purposes 
of comparison, the corresponding re- 
sults for the nonpulsed display from 
Fig. 2 are also presented. 
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Fic. 6. Effect of number of discrete 
samples upon identification of two correla- 
tions. The abscissa is the display duration 
in milliseconds. The ordinate is the C’/A 
ratio required for 75% correct identification 
from a C/A ratio of 0 db. The parameter 
is the number of samples produced by pulsing 
the z axis of the oscilloscope. For comparison, 
the results of Table 1 for the nonpulsed 
display are presented as the dashed curve. 
Tests of Series VII. 
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Over a wide range of conditions, 
performance is nearly invariant with 
a constant number of samples per 
presentation.‘ The notable exception 
is the rapid deterioration of discrimi- 
nation for displays with a_ small 
number of pulses. With this excep- 
tion in mind, we conclude that 
identification of pulsed correlational 
displays is, in large part, independent 
of the display duration and pulsing 
rate, except insofar as these factors 
determine the total number of sampling 
pulses. 


Series VIII: Effect of Sampling Pulse 
Duration 
The 


was 


number of sampling pulses 
the critical 
determinant of performance in Series 
VII. However, it may be noted that 
the effect of sampling pulse number 
is confounded by the sampling pulse 
duration. (The total sampling dura- 
tion is the product of the total number 


considered to be 


of sampling pulses by the duration 
per sampling pulse.) In 
test for confounding, additional iden- 
tification tests were carried out with 
sampling pulse durations from 2 to 80 
usec with sampling pulse numbers of 


order to 


3 to 80 pulses per display. It may be 
noted that, under the longer sampling 
pulse durations, large segments of the 
correlated 100-6800 cps wave forms 
were obtained rather than the discrete 
dots observed with the shorter pulsing 
durations. 

Over the range of pulse durations 
examined, equivalent performance was 
observed for a number of 
sampling pulses, irrespective of the 
sampling pulse duration. This result 
lends support to VII which 
concluded that the number of sam- 


constant 


Series 


‘The rapid stabilization of the correlation 
between 4 and 36 sampling pulses is illustrated 
by Licklider and Dzendolet (1948, Fig. 2C, 
p. 122). 
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pling pulses was the critical deter- 
minant of performance. 


DISCUSSION 


The deterioration in the identification 
of correlational scatterplots with reduc- 
tion in (a) display duration, (5) noise 
bandwidth, and (c) pulsing number, 
all lend support to the interpretation 
of S’s task as that of testing alternative 
statistical hypotheses. 

In this light, the deterioration of 
performance is not to be attributed to a 
change in S’s performance, but rather to 
a reduction in the reliability of the 
statistical information available to the 
display. Under comparable conditions, 
the performance of an ideal statistical 
observer would also suffer. Supporting 
evidence that statistical sampling of the 
stimulus was often a controlling factor 
upon performance is furnished by the 
extremely high inter-S agreement with 
short nonpulsed samples or with a small 
number of pulses. Under these circum- 
stances, all Ss were often in perfect 
agreement in selecting between the two 
correlations and often incorrectly. 

We have examined several models 
based on the sampling distribution of the 
correlation coefficient. To 
have singularly 
model yields satis- 
factory quantitative predictions of the 
data. We suspect that the 
theoretical development of a performance 
model for correlational distributions for 
the ideal statistical observer will point 
the way to the required model of the 
human observer. 


date, we 
been unsuccessful in 
obtaining a which 


strongly 


One final point: it is an interesting 
exercise to inquire about an appropriate 
representation of the correlational vari- 
able within a _ discrimination 
Distance in discrimination 


space. 
space is 
terms of the 


scaled in confusability 


among the items: the greater the con- 
fusability between two conditions, the 
representation in dis- 


closer is their 


crimination space. This approach sug- 
gests that we might profitably search 
for an appropriate metric of confusability 
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such as the d’ measure of the theory of 
signal detectability. 

An alternative approach may be 
employed for two special cases. If the 
absolute value of correlations required 
for a given level of identification |r;| 
is identical in the testing of (+7; vs. r. 
or zero correlation); (—7r; vs. 7); and 
(+r; vs. —ri), we have three points 
equidistant in discrimination space: +7;, 
—r;,andr,. The appropriate representa- 
tion is two discrimination axes, labeled 
+r; and —r; terminating at r; and 
separated by an angle of 60°. This 
result is, in fact, obtained with interaural 
auditory noise correlations. Direction 
of correlation may be discriminated, but 
its effects are only partially additive. 

Another special case is observed when 
the effect of the direction of correlation 
is completely additive. That is, with 
respect to an appropriate metric, we 
need twice the value of the metric re- 
quired for (+7; vs. ro.) or (—r; vs. fo) as 
for (+r; vs. —r;). In the present exami- 
nation of the visual correlational scatter- 
plot, 72 appears to be the appropriate 
metric, at least for intermediate identi- 
fication levels. The representation of 
the visual scatterplot in discrimination 
space is, then, simply a single discrimina- 
tion continuum, scaled in terms of 7’, 
and symmetrical about rp. 


SUMMARY 


Visual correlation scattergrams were ob- 
tained by mixing a common noise source, C, 
with independent noise sources, A and B, 
and displaying the mixtures (C + A) and 
(C + B) across the x and y coordinates of 


The correlational distribu- 
tions were displayed upon the oscilloscope 


an oscilloscope. 
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either (a) by a continuous beam of electrons 
or (b) by sampling the beam to obtain dis- 
crete samplings of the display. The task of S 
was to identify whether the reference correla- 
tion or a higher correlation was presented. 
The variables examined were the duration, size, 
and sampling characteristics of the display 
and the upper cutoff frequency of the noise 
distributions. 

In terms of the magnitude of the difference 
between the reference and variable correla- 
tions required for 75% correct identification: 
(a) Sensitivity is greatest for the highest 
reference correlations. (6) Sensitivity de- 
creases with narrower bandwidths and shorter 
display times, with rapid changes below burst 
durations of 100 msec. (c) Sensitivity for the 
sampled discrete pulsed display is determined 
by the number of sampling pulses, and is 
independent of the pulsing rate and duration 
of the display, except insofar as they deter- 
mine the number of sampling pulses. 

The results are interpreted in terms of the 
task of S as a tester of alternative statistical 
hypotheses under conditions of varying 
reliability of the display information. 
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TASK DIFFICULTY AND DRIVE IN VERBAL LEARNING 
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In view of the wide disparity in the 
results of experiments on the effect 
of variation in drive strength in 
verbal learning (e.g., Jensen, 1958) 
the following study was made in an 
attempt to test the basic essentials 
of Spence’s theory (Spence, 1958) 
under the least equivocal conditions. 
A situation was first provided in 
which the correct response was clearly 
dominant, the prediction being that 
the performance of high-drive Ss 
would be superior to that of low-drive 
Ss with medium-drive Ss occupying 
an intermediate position. It was 
thought that some previous £s in 
their selection of material to be 
learned may have merely demanded 
responses which were relatively easy 
to acquire without ensuring the actual 
dominance required by the theory. 
For example, if a “high association” 
or easy response is given on 20% 
block of trials then it must 
have been dominated at least momen- 
tarily by some incorrect response on 
80% of the block of trials. Thus 
an increase in drive strength might 
impair the performance of this ‘“‘easy”’ 
response as much as that of a “‘difh- 
cult’”’ response given on only 10% 
of the same block of trials. 
ent interpretation therefore favors 
clear dominance of the correct re- 
sponse rather than relative ease or 
high association value if the per- 
formance of a high-drive group is to 
be superior. 

A second type of situation was 
devised to ensure that an incorrect 
response was clearly dominant at the 
outset of acquisition, the prediction 
in this case being that the perform- 


of a 


The pres- 


ance of low-drive Ss would be superior 
to that of high-drive Ss up to the point 
where the clear dominance of the in- 
correct response was lost, when there 
would be a reversion to the first type 
of situation. As before, medium-drive 
Ss were expected to occupy an inter- 
mediate position. Judging by the 
results of some other experiments 
(e.g., Spence, 1956) in which low- 
drive Ss have performed better on 
low association-value or ‘‘difficult”’ re- 
sponses, this second type of situation 
is more readily obtainable. Theabove 
two predictions are most simply taken 
together as involving an interaction 
between task difficulty and drive level. 


METHOD 


Subjects.—The Ss were 63 students taking 
courses in psychology or medicine at the 
University of Sydney. Three Ss obtained a 
“lie score’’ of 7 or more on the index of drive 
level, the Taylor Manifest Anxiety Scale 
(MAS) (Taylor, 1953), and were rejected. 
The remaining 60 Ss were each allocated 
to one of three drive groups on the basis of 
their anxiety score on the MAS. The 20 low 
scorers (2-11) constituted the low-drive 
group (Mean = 7.5, Mdn. = 7.5, SD = 2.4), 
the 20 high scorers (20-33) the high-drive 
group (Mean=24.8, Mdn.=24.0, SD =3.4) 
and the remaining 20 Ss made up the medium- 
drive group (Mean = 15.5, Mdn. = 15.5, 
SD =2.2). 

Learning materials.—A list of 10 easy paired- 
associates was constructed by using common 
associates in the Kent-Rosanoff list (Rosan- 
off, 1938). To reduce intralist confusions 
no stimulus word was used which had been 
given as a response at any point in the Kent- 
Rosanoff study; the response word in each 
case was the answer most commonly given in 
free association. To ensure clear dominance 
of the correct response, performance on this 
easy list was not investigated until each 
S had given the correct response on two 
successive trials and no measures of perform- 
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ance (response speed) were used prior to this 
point. A list of 10 difficult paired-associates 
was constructed by taking the stimulus words 
of the easy list and pairing them with re- 
sponse words not given in free association 
according to the Kent-Rosanoff list. To 
ensure clear dominance of the incorrect 
response all Ss learned both lists, with the 
difficult list following the easy list. 
Procedure.—The items were presented in a 
Ranschburg exposure apparatus with ap- 
proximately 30 sec. between trials. The S 
was allowed 4 sec. to anticipate the response 
with the stimulus word present (in blue), 
and the response word (in red) was then 
presented for 4 sec. To reduce intralist asso- 
ciations the items of each list were presented 
in three orders, repeated as necessary. An 
electric commutator actuated the exposure 
apparatus and started an electronic chrono- 
scope simultaneously with the presentation 
of each stimulus word. The chronoscope was 
stopped by E pressing a key as soon as S gave 
the correct response or else it was reset after 
4 sec. in readiness for the next stimulus. The 
easy list was presented to each S first, to the 
criterion of two successive errorless trials 
followed by eight further trials. Then the 
difficult list was presented until the same 
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100F EASY LIST 
90 


MEAN RECIPROCAL LATENCY X 10° 
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criterion had been reached. 
answered the MAS items. 


Finally, each S 


RESULTS 


The results obtained with speed of 
correct response (Reciprocal Latency 
X 10°) as a measure of performance 
are presented in Fig. 1. The scores 
plotted for performance on the easy 
list are taken from the first two error- 
less trials and the following eight 
trials, all of which are labeled 
Trials 1-10. The scores for the diff- 
cult list are taken from two stages of 
learning: (a) the first 10 trials (la- 
beled also as Trials 1-10) and (0) 
the first two errorless trials and the 
following eight trials (labeled 
Trials A-J). It should bé noted that 
the mean for the first two 
trials on the difficult list are spuri- 
ously depressed through the use of 
an arbitrary maximum response time 
of 4 sec. 


as 


as 


scores 


—=@ 


DIFFICULT LIST 


” 


HIGH DRIVE e—o 
MEDIUM DRIVEo-——o 


LOW DRIVE 
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B 
TRIALS 


Mean response-speed early (Trials 1-10) and late (Trials A-J) in learning. 
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Trials 1-10 as defined above pro- 
vide the primary data of the experi- 
ment. These data are in accord with 
the prediction for the easy list, with 
the high-drive group superior in 
performance to the low-drive group 
throughout and the medium-drive 
group falling between these extremes. 
On the difficult list the prediction 
is confirmed to the extent that the 
low-drive group shows superior per- 
formance to the high-drive group 
on all trials except the first, but there 
is a deviation from prediction in that 
the high-drive Ss perform at a con- 
sistently higher level than the me- 
dium-drive Ss when the opposite 
result was expected. The results of 
an analysis of variance in these data 
are summarized in Table 1, homo- 
geneity of variance having been estab- 
lished. The predicted interaction is 
statistically significant. 

A further the general 
theory under consideration is avail- 
able in the data for the difficult list 
over Trials A-J as defined above and 
in a comparison of these data with the 
scores from Trials 1-10 for the same 
list. When correct responses are 
given on the difficult list it takes on 
the character of the easy list, leading 
to the prediction that high-drive Ss 
should now perform at a superior level 
despite their previous inferiority. 
That this effect does occur is shown 


test of 


TABLE 1 


\NALYSIS OF VARIANCI 
RESPONSI 


IN SPEED Of! 
SCORES OVER TRIALS 


Source df WS I 


Task difficulty (A 

Drive level (B) 3.44* 
AXB 5,655. 12.99** 
Ss 205. 6.90** 
Measures 


*P < 05 
*P < 001. 
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TABLE 2 


ANALYSIS OF VARIANCE IN SPEED OF RE- 
SPONSE SCORES ON THE DIFFICULT 
List IN EARLY AND LATE 
STAGES OF LEARNING 


Source df MS 


Stage of learning 


(A) 


d 1) 183,300.84) 228.85** 
Drive level (B) 2 
) 


4,838.81) 6.04* 

10,117.41) 12.63** 
800.95, 5.54** 
144.51 


\xB 
Ss 114 
Measures 960 


oP << mi 
P< 001. 


in Fig. 1 although now the low-drive 
group is slightly superior to the 
medium-drive group at most stages 
whereas it would have been expected 
to be inferior in performance. The 
effect now under consideration in the 
comparison of performance on the 
difficult list at two stages of learning 
involves an interaction similar to 
that already considered in Table 1. 
This was tested in a second analysis of 
variance, summarized in Table 2. 
The interaction is statistically signifi- 
cant and the second analysis confirms 
the finding in the primary data. 

A final test of the theory was made 
with two other measures of per- 
formance available in the acquisition 
of the difficult list, 


percentage fre- 
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MEAN PER CENT 
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iL i J 


hi Sa on, F 8. S-B 
TRIALS 





Fic. 2. Mean percentage frequency of 
correct response for the difficult list early 
in learning. 
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quency of correct response and num- 
ber of trials to the criterion. The 
result using the first of these measures 
over Trials 1-10 is plotted in Fig. 2, 
wherein it may be observed that the 
relative disposition of the low-, me- 
dium-, and high-drive groups deviates 
only slightly from that predicted. 
The mean trials-to-criterion scores 
for the low-, medium-, and high-drive 
groups were 7.86, 12.25 and 14.85 


trials, respectively, and exactly in 
accord with the prediction. 


DISCUSSION 


The results of this experiment are 
clearly compatible with the theory 
under consideration, presumably because 
it was conducted as nearly as possible 
within the specified boundary conditions. 
It would therefore seem that the effect 
of other methods of varying task dif- 
ficulty in verbal learning, e.g., through 
intralist similarity, should not necessarily 
be accepted as support for or disproof 
of the theory in the present state of 
knowledge but that an attempt should 
rather be made to extend the theory 
to account for more complex circum- 
stances. Some features of the present 
study suggest factors which might be 
taken into account in this endeavor. 

First, the equivocal results of the 
medium-drive group deserve attention. 
These Ss fell with the high-drive group 
when the incorrect response was domi- 
nant (difficult list, Trials 1-10) and with 
the low-drive group when the correct 
response was dominant (easy list, Trials 
1-10 and difficult list, Trials A-J) where- 
as it had been expected that they would 
occupy an intermediate position through- 
out. The reason for this finding may 
lie in the nature of the MAS as a measur- 
ing instrument in need of further 
refinement. 

Secondly, there appear to be some 
discrepancies in the data according to 
the measure of performance used. With 
speed of correct response as an index 
of response strength the performance 
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curves for the acquisition of the difficult 
list cross at an intermediate value of 
about 60 units on the scale used (by 
interpolation in Fig. 1). With percentage 
frequency of correct response as a meas- 
ure, however, no such crossing was ob- 
served (Fig. 2). An additional phenom- 
enon is the drop in the response speed of 
the low-drive group on the difficult list 
at about the stage of the first four error- 
less trials (Trials A-D, Fig. 1) whereas 
the other two groups continue to im- 
prove; this finding was not apparent 
with the other measure of performance. 
These discrepancies are almost cer- 
tainly due to the fact that response speed 
is a more sensitive measure and one 
which does not present an arbitrary 
maximum level, but it is possible that 
percentage frequency of correct response 
involves some complication as yet not 
made explicit. 

Finally, whereas performance on the 
easy task at Trial 10 when averaged over 
the three drive groups is 98.3 units on 
the scale of response speed, that for the 
difficult task at Trial J is only 73.7 units 
on the same scale (Fig. 1). The similar 
disposition of the high- and low-drive 
groups at these points, as well as the use 
of the same criterion of performance, 
would suggest that the absolute level of 
performance should be the same in the 
two cases and the obtained difference 
was not anticipated. A reason for 
expecting a difference may be found 
after the event in the possible inhibiting 
effects of nonreinforcement (Spence, 
1956) which might carry over from 
Trials 1-10 to Trials A-J in the case of 
the difficult list but which could not 
be generated in Trials 1-10 with the easy 
list. 


SUMMARY 


Situations were sought in which the basic 
implications of Spence’s theory about the 
interaction of task difficulty and drive in 
verbal learning could be tested in the least 
equivocal fashion. An easy list in which 
the correct response clearly dominated was 
obtained by measuring speed of response 
over a series of errorless trals and a difficult list 
involving clear dominance of the incorrect 
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response was achieved by having Ss then 
learn new responses to the stimuli of the easy 
list. With score on the Taylor MAS as an 
index of drive level, high- and low- drive 
groups performed as expected, with the high- 
drive group superior on the easy list and in- 
ferior on the difficult list but medium-drive 
Ss failed to occupy the predicted intermediate 
position with any consistency. The nature 
of the MAS as a measuring instrument, the 
index of response-strength used, and inhibi- 
tion due to nonreinforcement were indicated 
as three sper ific factors contributing to dis- 
parity in the results of experiments in this 
area as well as general failure to keep within 


the boundary conditions of the type of theory 


being tested. 
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ASSOCIATION VALUE AND FAMILIARITY IN 
SERIAL VERBAL LEARNING! 


RICHARD H. LINDLEY 


Carnegie Institute of Technology 


A well known generalization about 
verbal learning is that serial lists 
of high association value (HAV) non- 
sense syllables are easier to learn 
than lists of low association value 
(LAV). This generalization holds true 
under fairly varied conditions; e.g., 
when the methods of complete presen- 
tation and unaided recall are used 
(McGeoch, 1930) or when the syl- 
lables are spelled and presented seri- 
ally on a memory drum (e.g., Under- 
wood & Richardson, 1956). 

Analysis of lists of nonsense syl- 
lables in which association values 
were determined (Glaze, 1928; Hull, 
1933; Krueger, 1934) shows that 
some of the syllables are familiar 
since they are the first three letters 
of English words (e.g., BEL-BELL; 
FAS-FAST). Ninety-seven per cent 
of the 100% association value syl- 
lables in Glaze’s lists are the first 
three letters of one or more of the 
20,000 most common words in the 
English language as determined by 
Thorndike and Lorge (1944). The 
comparable figures for the 80% and 


0% association value syllables are 

© 
70% 
differences in 


and 0%, respectively. Thus 
the learning 
HAV and LAV syllables may simply 
be due to differences in the familiarity 
of these kinds of items. 


rate of 


The present study compared the 


1This paper was presented at the Mid- 
western Psychological Association Convention 
in 1959. The research was supported in part 
by a grant from the Humanistic and Social 
Studies Division of Carnegie Institute of 
Technology. Thanks are due Mary Ann 
Turnquist for clerical assistance. 


rates of learning of lists of nonsense 
syllables which had the same pub- 
lished association values but which 
differed appreciably in familiarity. 
If the relationship between associa- 
tion value and rate of learning is 
due to familiarity, the familiar syl- 
lables should be easier to learn than 
the unfamiliar 
association value. 

A second implication of the famili- 
arity hypothesis is that nonsense 
syllables should be no more difficult 
to learn than meaningful words pro- 
vided that both kinds of items are 
equally familiar. McGeoch (1930) 
found that words were easier to learn 
than HAV syllables; however, no 
control over familiarity was reported 
in that study. The present study 
controlled for familiarity. 


ones regardless of 


METHOD 


Experimental design.— There were four 
main groups of Ss. Group I learned a list 
of meaningful words; Group II 
familiar HAV syllables; Group III 
unfamiliar HAV syllables; and Group IV 
learned unfamiliar LAV syllables. The study 
consisted of two replications; the first used 
syllables drawn from Glaze’s lists and the 
second used syllables drawn from Krueger's 
lists. Twelve Ss were assigned to each group 
in each replication, making a total of 24 
Ss in each main group. 

In the first replication the meaningful 
words were constructed by adding letters 
to the familiar HAV syllables to form words 
(e.g., FOL-FOLD). The results showed almost 
no difference in the rate of learning of the 
familiar HAV syllables and words. The failure 
to obtain a significant difference could have 
been due to the fact that the words were 
longer than the syllables; therefore, Ss in 
Group I in the first replication were replaced 
with Ss who learned words of the same length 


learned 
learned 
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TABLE 1 


Lists 


Words 


(Repl. 1 and 2) (Repl. 1) 


Group 1* Group 2* 


Group 3 


LIP 
TAX 
DOG 
RED 
CUT 
MAN 
FEW 
yJoY 
HIS 


RUJ 
WIM 
BOC 
DEY 
LAQ 
VIF 

TAZ 

NOK 
HUP 


WID 
FOL 
yac 
KEP 
BIR 
HOM 
SUG 
NEX 
GAV 


Glaze's Syllables 


UsEp 


Krueger’s Syllables 
(Repl. 2) 


Group 4 
LEQ 
cij 
WUB 
ZOM 
DAX 
NEH 
VAF 
YOP 
GUK 


Group 2* Group 3 
LUV 
QIT 
ROX 
WUN 
DAC 
YEZ 
KOS 
JEM 
FAX 


Group 4 
WES 
BUR 
PIC 
GOV 
NAM 
JUL 
DIF 
YEL 

TAK 

| 


YUT 
BIW 
GOX 
DAJ 

Z1IQ 

FAP 
KEC 
VUH 
MOJ 


* All items for Groups 1 and 2 occurred 100 times per million or more in the Thorndike-Lorge count except 


TAX, JAC, and jr 


as the nonsense syllables. The latter Ss 
were run during the second replication. 

Materials.—Familiarity was estimated by 
using the G or General count of the Thorndike 
and Lorge (1944) word count. The familiar 
syllables were the first three letters of English 
words which occurred 50 times per million 
or more in English text. The unfamiliar 
syllables were not the first three letters of 
any words occurring one or more times per 
million. Thirteen of these syllables were not 
parts of any words which occurred 10 times 
per .million or more; five of the syllables 
occurred once or twice in positions other than 
the first three letters in words which occurred 
10 times per million or Thus 
familiar HAV syllables were clearly more 
familiar than the unfamiliar HAV syllables. 
The meaningful words were selected from 
three-letter words in the Thorndike-Lorge 
count which occurred 50 times per million or 
more. 


more. the 


Each list contained nine items. The items 
were selected with the restriction that formal 
intralist similarity be low. No consonant 
letter appeared more than twice in any one 
list. The items are presented in Table 1 
in the orders in which they were learned by 
Ss. The mean association values and ranges 
of Glaze’s syllables for Group II were 84.6% 
and 67%-100% respectively; for Group III, 
84.6% and 67%-100%; for Group IV, 7.4% 
and 0%-27%. The corresponding 
of Krueger’s syllables for Group II were 
94.5% and 85.5%-99%; for Group III, 
94.4% and 89.5%-96% ; for Group IV, 49.4% 
and 34%-60%. 

Procedure. 


values 


The items were presented on a 
memory drum at a 2-sec. rate. An asterisk 


L which occurred 50 times per million but less than 100 times. 


served as a cue for the first item. There was 
a 6-sec. pause between trials. Typical in- 
structions for learning by the anticipation 
method when the items are spelled were read 
to S at the start of the experiment. The lists 
were learned for 25 trials and the mean num- 
ber of correct responses occurring during 
these trials constitute the major data of this 
study. 

At the termination of serial learning, S was 
instructed to report any meaningful associa- 
tions he had made to the nonsense syllables 
during serial learning. The E spelled the 
syllables one at a time and S reported any 
associations he had made. 

Subjects.—The Ss were 96 undergraduate 
men and women from Carnegie Institute of 
Technology. None of the Ss had had previous 
nonsense syllable learning experience. The 
Ss were assigned to groups according to a 
randomized block design. 


RESULTS 


Serial learning.—The means and 
SDs for the mean number of correct 
responses during the 25 serial learning 
trials are presented in Table 2. 
Analysis of variance showed that 
only the differences between groups 
were significant (F = 19.90, df = 3 
and 66, P < .01). Since the F for 
replications was not significant, the 
means for the individual groups 
were pooled and the differences be- 
tween the main groups were tested 
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TABLE 2 


MEANS AND SDs For NUMBER OF CORRECT 
RESPONSES IN 25 TRIALS 


Replication 1 Replication 2 


Groups 


Mean SD | Mean 


169.25 

151.83 

123.25 
83.00 


I 166.25 | 15.69 
II 45.43 
| oe 37.80 
IV | 103.67 


30.95 


with the ¢ test; a difference of 22.01 
is significant at the 5% level. Thus 
all differences reached P = .05 except 
that between Groups I and II. 
Inspection of Table 2 shows that 
there is a trend for the meaningful 
words to be easier to learn than the 
familiar HAV syllables. Further anal- 
ysis of the data showed that four Ss 
in Group II failed to learn the list 
to a criterion of one perfect recitation 
within 25 trials whereas all Ss in 
Group I learned the list within 25 
trials. When medians are compared, 


— 


Pa 
nails 


—~. 


Mean Numper Correct 
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TABLE 3 
MEAN NUMBER CORRECT PER TRIAL FOR 
SUCCESSIVE BLOCKS OF FIVE TRIALS 
FOR THE MAIN GROUPS 
Trial Blocks 
Groups 


11-15 16-20 


7.33 
6.61 
4.80 
3.80 


8.30 
7.28 
6.18 
4.98 


the difference between Groups I and 
II disappears. (The median number 
correct was 166 for Group I, 163 for 
Group II, 124.5 for Group III, and 
93.5 for Group IV.) In 
the number of trials to reach a 
criterion of one perfect recitation, 
the mean number of trials was 11.83 
for Group I (N = 24) and 11.65 
for Group II (N = 20). Thus, for 
the majority of Ss, the familiar HAV 
syllables were no more difficult to 
learn than the meaningful words. 


terms of 


o«—— -«Garoup I 
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* « Grove IIL 
«———-« Group lV 


—_ 


- 
ys 





Seriat Position 


Mean number correct in 25 trials as a function of serial position. 
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TABLE 4 
MEANS AND SDs ror NUMBER 
OF ASSOCIATIONS 


Replication 1 Replication 2 


Groups 


Mean Mean SD 


1.78 
1.65 
2.16 


7.00 
5.67 


4.00 


II 6.33 
Ii! 4.33 
IV 4.08 


The mean number of correct re- 
sponses per trial for successive blocks 
of five trials for the main groups are 
presented in Table 3. The curves 
for the high familiarity items are 
negatively accelerated and asymp- 
totic to the x axis while the curves 
for the low familiarity items ap- 
proximate straight lines. This may 
simply reflect the fact that the high 
familiarity items were better learned 
during the 25 trials than the low 
familiarity items. Figure 1 presents 
serial position curves for the four 
main groups. Skewed, bowed serial 
position curves which typical 
of learning disconnected verbal ma- 
terial were obtained for all groups. 
The curves for the high familiarity 
items are less bowed than the curves 
for the low familiarity items. 

Number of associations.—The means 
and SDs for the number of associa- 
tions reported by Ss are presented in 
Table 4. Analysis of variance showed 
that only the differences between 
groups were significant (F = 9,99, 
df = 2 and 44, P = .01). The ¢ test 
shows that a difference of 1.23 asso- 
ciations between groups will be signifi- 
cant at P = .05. the 


are 


Thus only 
differences between Groups I and III, 
and Il and IV reach an acceptable 
level of significance. 


DISCUSSION 


The results clearly show that famili- 
arity is an important variable in serial 
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nonsense syllable learning. Even though 
the published association values of the 
syllables learned by Groups II and III 
were equal, the familiar syllables were 
significantly easier to learn than the 
unfamiliar syllables. Furthermore, when 
familiarity was controlled, the familiar 
syllables were no more difficult to learn 
than the meaningful words for most Ss. 

The results showed that there 
was a direct relationship between ease 
of serial learning and the number of 
associations evoked by the _ syllables, 
although only the differences in associa- 
tions between the familiar and unfamiliar 
syllables reached the .05 level of signifi- 
cance. 


also 


These results are consistent with 
the theories of Mandler (1954), and 
Dollard and Miller (1950) which predict 
that items which elicit distinctive im- 
plicit responses will be easier to learn 
than items which do not elicit such 
distinctive The Ss in the 
present study reported more associations 
(distinctive responses) to the familiar 
syllables than they did to the unfamiliar 
syllables. The fact that the unfamiliar 
HAV syllables were easier to learn than 
the LAv syllables can be explained by 
this reasoning. That is, the unfamiliar 
HAV syllables elicited more associations 
than the LAV syllables and thus may have 
been more distinctive. 

The results of this study are consistent 
with Hall (1954). 
In that study familiar words were found 
to be easier to recall than unfamiliar 
ones. Thus familiarity is an important 
variable in the learning of both nonsense 
syllables and words. 


responses. 


results obtained by 


SUMMARY 


The present study tested the hypothesis 
that the relationship obtained between pub- 
lished association values and ease of serial 
learning was due to familiarity. The results 
showed that lists of familiar syllables were 
easier to learn than lists of unfamiliar syllables 
when the items were spelled, even though 
the average published association values 
for the lists were identical. Furthermore, 
no significant difference was obtained between 
the rate of learning of meaningful words 
and nonsense syllables when familiarity was 
equated. The average number of associa- 
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tions evoked by the syllables during learning 
was directly related to ease of learning. The 


results were interpreted as being consistent 
with theories which emphasize the role of dis- 
tinctive implicit responses in verbal learning. 
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EFFECTS OF PROBABILITY OF MISINFORMATION 
AND NUMBER OF IRRELEVANT DIMENSIONS 
UPON CONCEPT IDENTIFICATION ! 


VLADIMIR PISHKIN? 


University of Utah 


The primary aim of the present 
experiment was investigate the 
effect of incorrect, or misinformative, 
feedback (MF), in conjunction with 
various degrees of task complexity, 
upon the performance of human Ss 
in concept identification. 

Variation 


to 


of misinformative feed- 


back, MF, in this experiment, was 


an operation similar to partial rein- 
forcement in a two-choice guessing 
problem, where asymptotic perform- 
ance typically probability 
matching. The present experiment 
differed from guessing game studies 
(e.g., Grant, Hake, & Hornseth, 1951; 
Humphreys, 1939) in the following 
respects: (@) percentage 
feedback for either of the 
sponses depended upon 
the stimulus, 
(6) stimulus information irrele- 


reat ‘hes 


correct 
two 


of 
re- 
only one 
binary dimension of 
and 
vant to either response was introduced 
It was 
hypothesized that the introduction of 
irrelevant 


in a quantifiable manner. 


information would reduce 
rate of learning. 
in 


Despite differences 
methodology, it was expected 

1 This article is based on a dissertation sub- 
mitted to the Department of Psychology of 
the Graduate School of the University of 
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ments for the PhD degree. The author is 
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for his guidance and assistance throughout 
the course of this investigation. This research 
was supported, in part, by a research grant 
(M-1759): from the National Institute of 
Mental Health, United States Public Health 
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371 


that probability matching would take 
place in this experiment as in other 
partial reinforcement studies. Morin 
(1955) has shown that the acquisition 
of correct responses in a two-choice 
discrimination situation is negatively 
and progressively influenced by MF. 
Morin’s data suggest that various 
dégrees of MF in a concept identifica- 
tion situation would produce differ- 
ential acquisition rates of correct 
responses. 

Estes and Straughan’s (1954) model 
demonstrated that with the exception 
of a few deviant cases, terminal 
response probability approached theo- 
retical asymptotic response proba- 
bility, #, the probability of reinforce- 
ment for that response. Restle (1955, 
1957) has proposed a model similar 
to Estes and Straughan’s except that 
the proportion of relevant stimulus 
cues in a problem determines the 
rate of learning. This theory assumes 
a population of available stimuli con- 
sisting of sets of cues which are either 
relevant or irrelevant to the prob- 
lem’s solution. Formally, 6 = r/r + 2, 
where r is number of relevant cues, 7 
is number of irrelevant cues, and 6 
is the learning rate. By assuming 
that over a series of trials all irrelevant 
cues will be adapted and thus ren- 
dered ineffective, and that proba- 
bility of response equals proportion 
of conditioned, unadapted cues, Restle 
was somewhat successful in predicting 
the outcome of a variety of two-choice 
discrimination problems. In Restle’s 
(1955) framework, the number of 
errors (£,) made before solution is 
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approximated by: 


_ log 6 


(1—6) log (1—8) 


7 ol 1 
E.=3+3 


[1] 
where @ is a rate of learning parameter, 
originally assumed equal to 6. Ina 
later paper, Restle (1957) reasoned 
that rate of learning parameters 
depend on z, the proportion of time 
a relevant cue is reinforced. Further, 
Restle proposed that a relevant cue 
which is reinforced less than 100% 
of the time has validity less than 
unity. The following equation repre- 
sents cue validity, V, as a function of 


TT: 
V=4e —4r +1 [2] 


Then, in the extended model: 


6= Vi [3] 
Since S may not perceive all of the 
cues presented, the parameter 6 must 
be estimated from data. Once this 
has been done, E; should be pre- 
dictable for various conditions in the 
present design. Thus, this study 
serves, in part, as a test of Restle’s 
model. 

The threefold purpose of the present 
experiment was: (a) to determine the 
effect of MF under various degrees 
of complexity, (b) to test the hy- 
pothesis of probability matching, and 
(c) to account for the data in terms of 
Restle’s discrimination model. 


METHOD 


Subjects. —The Ss were 180 volunteer 
students in elementary psychology courses 
who were divided randomly into 45 treatment 
groups of 4 Ss each (2 men and 2 women). 

Task and apparatus.—All Ss were presented 
a series of 192 geometric patterns with a 
Dunning animatic projector. The screen, 
mounted at S’s eye level, was bordered by 
an opaque shield which prevented S from 
seeing other than pertinent apparatus. In 
front of S was a panel with two response keys 
and two amber signal lamps. The S’s task 
was to solve a two-choice discrimination 
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problem by successively matching one or the 
other of two keys to invariant pattern 
dimensions, e.g., depression of the left key 
for the image of a triangle; of the right, for a 
square, etc. The keys were not marked in 
any way. 

Procedure.— At the beginning of each 
session, S was verbally instructed as to the 
nature of the task, the meaning of the feed- 
back lamps, number of trials required, and 
manipulation of the controls. The S began 
his task by viewing a geometric pattern 
projected on the screen. After a self-deter- 
mined interval, S responded by pressing one 
of the two keys. The momentarily depressed 
key initiated a signal which recorded S's 
choice of responses on the chart of an Ester- 
line-Angus operations recorder and triggered 
a cascaded multiple-phase electronic timer. 

Phase 1 of the timer advanced the film- 
strip to a blank frame. Phase 2 activated a 
Western Union tape transmitter precoded 
with a tape which matched the filmstrip 
programming, thereby enabling the informa- 
tion feedback, irrespective of S’s choice, to be 
registered on the Esterline-Angus operations 
recorder. In addition, the same information 
coded on the tape led to the illumination 
of one of the two panel lamps above S’s 
keys for a duration of 1 sec. Phase 3 ad- 
vanced the filmstrip to the next stimulus 
permitting S to begin another trial 5 sec. 
after S’s last response. 

Nine strip-filmed series of patterns were 
used. There were three different problem 
types involving form, size, or number, as the 
relevant dimension. A dimension was termed 
“relevant” when its use by S was requisite to 
identify a pattern with an appropriate key. 
Each problem type was further elaborated 
into one of three specific problems which 
differed in complexity, i.e., amount of irrele- 
vant information. All irrelevant dimensions 
had a zero correlation with correct responses. 
When a dimension was neither relevant nor 
irrelevant, it appeared without variation at 
only one of its two levels within a given series. 

In addition to the basic variations of 
relevant and irrelevant dimensions, some Ss 
performed under a third condition wherein 
they received erroneous feedback. This feed- 
back is interpreted by the E as MF. The 
probabilities that S would receive a “correct"’ 
signal over the 192 trials when he made a 
response actually correct with respect to the 
relevant dimension, were 1.00, .90, .80, .70, 
and .60 as determined by 0, 10, 20, 30, and 
40% of MF, respectively. That is, on some 
occasions when S had made a “correct” 
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response with respect to the relevant dimen- 
sion, the signal lamp indicating a ‘“‘wrong”’ 
response was lighted, deliberately misinform- 
ing S as to his choice of keys. Conversely, 
on some occasions, S received feedback to 
indicate he had responded correctly when, 
in fact, he had not done so. The scheduling of 
MF was determined by use of a table of 
numbers. The order of pattern 
presentations semirandom (i.e., all 
patterns appearing equally often but with 
the restriction that the pattern 
never presented on successive trials 
Design.—A 5 X 3 X 3 X 2 factorial design 
was used which incorporated the five degrees 
of MF (0, 10, 20, 30, or 40% of S’s responses 
met with erroneous feedback over 192 trials), 
3 levels of complexity (1, 3, or 5 irrelevant 
3 problems 


random 
was 


same was 


dimensions), (form, 
large or small; or 
one object or two objects, 


dimension), 


square, or 
number, 
relevant 


triangle ; size, 


ind 2 sex groups. 


RESULTS 

studied 
respect 
dimension 


The dependent variable 
number of errors with 
to the E-defined relevant 


was 


IDENTIFICATION 


This 
score was analogous to the dependent 
variable used in guessing games and 
is not 


over the 192 stimulus patterns. 


the number of errors as seen 
from S’s point of view. 

Main effects —The data were ana- 
lyzed for the following four main 
effects: MF, Complexity, Problems, 
and Sex. The 192 stimuli were 
pooled into 12 blocks of 16 patterns 
each for the convenience of statistical 
and graphic treatment. Owing to the 
visible character of the graphs and 
the differences in learning rate, the 
analysis of variance was performed 
on the last two blocks (32 trials), 
omitting the Blocks variable and the 
0% MF group. Summary of this 
analysis is presented in Table 1. 
Only the main effects of MF and 
Complexity were found to be signifi- 
cant. The increase of mean number 
of errors over the five degrees of MF, 
which was apparently linear, is shown 
in Fig. 1. Orthogonal polynomial 
analysis was performed on this func- 
tion and only the linear component 
reached a significant level (P < .001). 
The significance of Complexity agrees 
with previous work on effects of this 
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Mean errors as a function of 
percentage of MF. 
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variable in concept identification 
(Archer, Bourne, & Brown, 1955; 
Bourne, 1957; Bourne & Pendleton, 
1958; Brown & Archer, 1956). The 
mean numbers of errors for the condi- 
tions of 1, 3, and 5 irrelevant dimen- 
sions for the two blocks were 7.79, 
8.73, and 10.31, respectively. The 
orthogonal polynomial analysis veri- 
fied that only the linear term of Com- 
plexity reached significance (P <.001). 

As expected, the main effect of Sex 
was insignificant. Neither was the 
Problems main effect significant, a 
result inconsistent with those of 
Archer, Bourne, and Brown (1955), 
and Brown and Archer (1956). The 
fact that the same irrelevant dimen- 
sions were used with the three relevant 
dimensions, supports the results of 
Grant and Curran (1952), and Heid- 
breder (1948, 1949), where identifica- 
tion of number and form was found 
to be equally difficult. 

Following the suggestion of Lind- 


quist (1953), graphic presentations of 


interactions were made. The inter- 
action of MF X Complexity indi- 
cated that MF has a greater retarding 
influence on performance as_ task 
complexity increases. Further analy- 
sis of this interaction by orthogonal 
polynomials (Myers, 1959) revealed 
significant linear and quadratic trends. 
Both the linear and quadratic trends 
were broken down into their three 
components. It was shown that both 
main trends have significant quadratic 
components. In addition, the linear 
trend had a significant linear compo- 
nent (Table 1). This indicated that 
the slopes of MF X Complexity curves 
differ significantly on these compo- 
nents. Significance of the MF XK C 
x P XS interaction is difficult to 
interpret. 

An additional analysis of variance 
was performed on Blocks. This 
revealed a significant inverse rela- 
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tionship between number of trials 
and errors (F = 41.62, 11 and 990 df, 
P < .01). The interaction of Blocks 
xX MF was also significant (F = 4.29, 
44 and 990 df, P < .01), indicating 
that Ss performing at the varying 
levels of MF approach different 
terminal levels of performance. 
Probability matching.—Figures 2, 3, 
and 4 represent mean number of 
errors per block for each MF group 
at each level of complexity. The 
obtained asymptotic value for each 
of the four degrees of MF was based 
on the mean number of errors on 
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MEAN ERRORS 


Fic. 4. Mean 
patterns. The 
MF. 


errors per block of 
parameter is percentage 
Five irrelevant dimensions. 


Blocks 11 and 12, subgroups being 
the insignificant Sex 

variables. The ex- 
pected error scores (probability match- 
ing) were determined by the prob- 
ability of MF. Since there were 32 
the last two the 
expected error scores were 3.2, 6.4, 
9.6, and 12.8 for 10, 20, 30, and 40% 
MF conditions, respectively. Differ- 
were calculated between ob- 
tained and expected mean error scores. 
The hypothesis that the average differ- 
ence scores over the three complexity 
groups would be tested. 
Only the Mean source of variance for 
the 40% 


pooled 
and 


over 
Problems 


blocks, 


trials in 


ences 


Zero Was 


MF group was significant 


(P < 01). Only the 30% MF com- 
plexity groups differed significantly 
among themselves in the extent of 
their deviations from the predicted 
value (P < .05). This effect 
linear as indicated by the orthogonal 
polynomial, with performance under 
greater degrees of complexity produc- 
ing more errors than predicted by 
probability matching (Table 2). 

A tabular analysis on number of 
left and right button responses re- 
vealed the same _ trend trials 
throughout the various treatment 
conditions. 

Restle’s model.—In Restle’s theory 
(1957), 6 denotes the proportion of 
relevant cues in the problem. The 
values of 6 for the three levels of 
complexity were estimated from the 
performance of the 0% MF groups. 
Using these 4s, the value of cue 
validity, V, which would cause each 
of the 12 MF X Complexity groups 
to make the mean number of errors 
observed was computed by Equation 
1. Following Restle’s procedure (1957, 
p. 187), the values of V predicted by 
Equation 2 were found and compared 
with the computed Vs. The obtained 
V for each of the 12 conditions was 
subtracted from the predicted V, and 
an analysis of variance of these 12 
difference sce: “s was performed (Table 


was 


over 


TABLE 2 


ANALYSES OF VARIANCE FOR DEVIATIONS FROM EXPECTED MEAN ERRORS 


Source 


Mean 
Complexity 
Linear 
Quadratic 
Error 


12.02 
47.99 
60.36 
33.89 
20.08 


*P < 0S. 
7 P < 01. 
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43.53 
60.17 
26.89 
27.74 


Misinformation Feedback 


F 


9.08** 
35.09 

20.16 

50.00 

22.19 | 


| 
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TABLE 3 


ANALYSIS OF VARIANCE FOR DIFFERENCES 
BETWEEN OBTAINED AND 
PREDICTED Vs 


Source MS F 
Mean 
Cue Validity (V) | 
Linear 
Quadratic 
Cubic 
Complexity (C) 
Linear 
Quadratic 
V X C (Error) 


io 


|.002 | 2.00 

| .061 | 61.00* 
18 | 180.00* 
.00008 | 08 
0007 | 70 
.001 1.00 
| 0005 | 50 
00104} 1.04 
001 | 


—e eee 


a 
o 


°P < 001. 


3). The main effect of V was signifi- 
cant (P < .001), indicating some 
significant discrepancy between pre- 
dicted and obtained Vs. This effect 
was linear as tested by the orthog- 
onal polynomial ; deviations becoming 
greater with increase in Complexity 
and MF. 

One final theoretical analysis con- 
cerns the effect of number of irrelevant 
dimensions. Bourne and Restle (1959) 
assume that each such dimension 
is an equal source of irrelevant cues. 
From this they deduce that 1/6 
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Fic. 5. 1/@7 as a function of number 
of irrelevant dimensions. Estimates based 
on the mean of total errors for each level of 


complexity over combined degrees of MF. 
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is a linear function of the number of 
irrelevant dimensions. Figure 5 is a 
plot of the reciprocal of estimated 67, 
which here should be proportional 
to 1/6, in each of the three complexity 
conditions with all the MF conditions 
combined. The function is apparently 
straight-line in form. 


DISCUSSION 


The results show that the efhciency 
of concept identification is a decreasing 
function of MF. This was expected 
from the results of Bourne and Pendleton 
(1958) in their study of probability of 
information feedback. In that study, 
the omission of feedback on a certain 
percentage of trials significantly reduced 
performance. This suggested strongly 
that MF would have a similar, though 
greater, effect. Consistent with previous 
work on effect of task complexity, the 
present investigation showed that irrele- 
vant information has a quantifiable effect 
on task difficulty. Furthermore, the 
effect of MF depends on the level of 
task complexity, i.e., MF inhibits con- 
cept identification more as task com- 
plexity increases. It was shown that 
the slopes of MF X C curves differed 
significantly on both linear and quadratic 
components, producing decrement in 
performance. 

The hypotheses concerning the effects 
of MF were based, in part, on the results 
of guessing game experiments. Owing 
to the complexity differences, rate of 
approach to asymptote ought to be 
less in the present task, but terminally, 
Ss were expected to match probabilities. 
In the main, these expectations were 
realized. However, it is important to 
note that with greatest MF (40%), the 
number of errors was significantly greater 
than expected by probability matching. 
It is regrettable that more trials were not 
given, since the performance of certain 
groups might have surpassed probability 
matching performance, had further op- 
portunity been provided. Trends of this 
sort were indicated in Fig. 2, 3, and 4. 

In this study the sequence of MF was 
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randomly determined within the 192 
patterns. In other studies, e.g., Morin’s 
(1955) the percentage of MF was dis- 
tributed over blocks of fewer 
thus providing a more homogeneous 
distribution of MF. Further investiga- 
tions of the effects of various distribu- 
tions of MF would probably shed more 
light on the validity of V. It seems 
plausible to expect that a homogeneous 
distribution of MF would render it less 
effective as an inhibitor of performance. 
In relation to other of MF, 
present findings must be interpreted 
with caution. The Ss’ awareness of the 
existence or possibility of MF has a 
significant effect on performance as 
shown by Morin (1955). In this experi- 
ment, some indirect suggestion of the 
presence of MF was made in the instruc- 
tions. It is probable that task difficulty 
could be increased by omitting this 
suggestion. 

Some 


trials, 


studies 


met in the theo- 
retical analysis of data based on Restle’s 
(1955, 1957) discrimination model. The 
@s, or rates of learning, were somewhat 
lower in the present data than is the 
case in simple guessing games (Restle, 
1957). Furthermore, @ tends to decrease 
as task complexity increases, thus indi- 
cating that complexity accounts, at 
least in part, for the difference. The 
relationship between 1/@7 and number 
of irrelevant dimensions linear as 
predicted. Quantitative predictions of 
(Equation 1), were 
grossly divergent from obtained errors 
for the four degrees of MF, and signifi- 
cant difference and 
predicted Vs were This result 
may from First, 
there were several unexpected inversions 
in the obtained error scores, i.e., trends 
in which errors did not consistently 
increase with combined increases in MF 
and Complexity (Fig. 2, 3, and 4). 
Such inconsistencies mediate against 
securing correct estimates of certain 
theoretical parameters, such as 6 and V, 
since these parameters are based on the 
assumption of additivity of irrelevant 
Second, it is possible that Restle’s 


success Was 


was 


errors however, 


estimated 
found. 
two 


between 


stem sources. 


cues. 


concept of cue validity is not appropriate 


in the analysis of problem solving be- 
havior, since V is based on the function 
of x. It is important to note that r may 
be influenced by the differential dis- 
tributions of MF. Bourne and Restle 
(1959) have provided evidence that the 
discrimination model can account for 
the results of some concept identification 
experiments, although the present study 
provides the first test of the cue validity 
formulation. Present results do not 
suggest any simple modification of this 
concept. 
SUMMARY 


The primary aim of this experiment was to 
investigate the effects of misinformative 
feedback (MF) and task complexity upon 
concept identification. Each of the 180 Ss 
was given 12 blocks of 16 patterns, a total 
of 192 stimuli. The degrees of MF were 0, 
10, 20, 30, and 40% and of complexity, 1, 3, 
and 5 irrelevant dimensions. There were 
three problems, each with a different relevant 
dimension, and equal numbers of men and 
women Ss were included in each treatment 
combination. Essentially, S’s task was to 
learn the correct method for classifying 
visually presented geometric patterns. 

The major findings were: (a) Performance 
improved significantly over trials. (b) MF 
was a significant source of variance in the 
analysis. The asymptotic mean error scores 
for the 10, 20, and 30% MF conditions did not 
significantly differ from the mean predicted 
values based on a _ probability matching 
hypothesis. (c) The relationship between 
errors and Complexity was shown to be 
linear. Analysis of variance produced a 
significant MF X Complexity interaction, 
suggesting an increase in the effect of MF 
at higher levels of Complexity. (d) Sex and 
Problems effects had no significant influence 
on performance. (e) The relationship be- 
tween the reciprocal of the estimated propor- 
tion of relevant and the number of 
irrelevant dimensions was linear in accordance 
with Restle’s (1955, 1957) learning model. 
(f) Number of errors as predicted by this 
model was greatly divergent from the ob- 
tained errors for all the combined MF XK Com- 
plexity conditions. 


cues 
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EYELID CONDITIONING PERFORMANCE UNDER PARTIAL 
REINFORCEMENT AS A FUNCTION OF UCS INTENSITY! 


LEONARD E. 
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Investigations of the relationship 
of UCS intensity to performance in 
eyelid conditioning have consistently 
employed (100%) _rein- 
forcement. Thus, while studies have 
demonstrated 1958) that 
performance is an increasing function 
of the intensity of the UCS, evidence 
is not available concerning the rela- 
tionship of this variable to eyelid 
conditioning performance under par- 
tial schedules. _Al- 
though there are a number of eyelid 
conditioning studies in the literature 
that have used 50% partial reinforce- 
ment (Grant & Hake, 1951; Grant & 
Schipper, 1952; Grant, Schipper, & 
Ross, 1952; Gynther, 1957; Ross, 
1959), a comparison of the perform- 
ance levels obtained in these experi- 
ments as a function of the UCS 
strength employed is not feasible 
since in some studies the puff strength 
was unspecified and, in addition, 
experimental conditions, which might 
in themselves account for performance 
differences, have varied from study 
to study. 

Under 100% reinforcement 
tions the relationship between condi- 
tioning performance 
CRs, Trials 41-80) and the intensity 
of the UCS has been found to be a 


continuous 


(Spence, 


reinforcement 


condi- 


negatively accelerated increasing func- 
tion (Spence, 1958). The question 


! This study is part of a project concerned 
with the influence of motivation on perform- 
ance in learning under Contract N9 onr-93802, 
Project NR 154-107 between the State Uni- 
versity of lowa and the Office of Naval 
Research. Robert S. Blanchard helped in the 
collecting of the data. 

2 Now at the University of Wisconsin. 


(percentage of 


arises whether or not the relationship 
under partial reinforcement conditions 
is of the same general form. The 
previously cited studies which em- 
ployed partial reinforcement uniformly 
found higher levels of conditioning 
with 100% reinforcement than with 
50% partial reinforcement. Since 
this continuous reinforcement superi- 
ority is maintained when comparisons 
are made in terms of reinforced trials 
rather than trials alone, the lower 
level of performance of the partial 
groups is presumably due to the 
decremental effects of nonreinforced 
trials. It has been suggested (Ross, 
1959) that these effects are, in part 
at least, due to factors that might 
be classified as “set” variables (i.e., 
under some degree of voluntary 
control). 

Whatever the nature of the factors 
producing the decrement it seems 
quite possible that the introduction 
of nonreinforced trials might result 
in a relationship between conditioning 
level and UCS intensity different 
from that obtained with continuous 
reinforcement. In the present experi- 
ment groups of Ss under a random 
50% reinforcement schedule were 
conditioned with different intensities 
of the UCS in an effort to provide 
information concerning the function 
relating conditioning performance 
under partial reinforcement conditions 
to the strength of the UCS. Inaddition, 
one group was conditioned with contin- 
uous reinforcement to supplement ex- 
isting 100% data so that 50%-100% 
comparisons could be made at an addi- 
tional UCS intensity value. 
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METHOD 


Subjects —The Ss were 115 women and 58 
men from an introductory psychology course. 
Of these, 10 men and 6 women were dis- 
carded for exceeding the criterion used to 
exclude voluntary responders (Spence & 
Ross, 1959), while 1 woman was excluded 
from the data for giving CRs to test trial 
presentations of the CS alone. Six additional 
Ss were discarded, three due to E error and 
three due to apparatus failure. The remain- 
ing 150 Ss were assigned at random to five 
groups except that the proportion of women 
to men was fixed at a ratio of 7 to 3. 

Apparatus and method of recording.—The 
apparatus and method of recording which 
involved a potentiometer—polygraph system, 
has been described elsewhere (Spence & 
Taylor, 1951). The CS was an increase in 
brightness of a 2}-in. diameter circular milk 
glass disk from less than .004 apparent ft-c 
to .32 apparent ft-c. The UCS was a 50- 
msec. air puff delivered through a .062-in. 
diameter orifice to the right eye. 

Conditioning procedure.—Each S was given 
instructions to blink once to the ready signal 
and then to look at the disk on the wall, 
remaining as relaxed as possible, until the 
light went out. Following the reading of the 
instructions three test trial presentations of 
the CS alone and one presentation of the UCS 
alone were given. Immediately following this 
preliminary procedure each S received 140 
trials at intertrial intervals of 15, 20, and 
averaging 20 and arranged 
according to a fixed schedule. The word 
“ready” preceded each presentation of the 
CS by 2, 3, or 4 sec. according to a prear- 
ranged schedule. 

Four groups of 30 Ss were conditioned with 
a 50% reinforcement schedule consisting of 
half reinforced and half nonreinforced trials, 
presented in a prearranged sequence within 
which neither type of trial occurred more than 
twice in succession. A rei. forced trial in- 
volved a 500-msec. CS-UCS interval while 
on nonreinforced trials the UCS followed the 
CS onset at 2500 msec. Each group was 
conditioned with a different intensity of UCS. 
The four values used were .33, .6, 1.0, and 
4.0 psi. An additional group of 30 Ss was 
given 100 trials with continuous reinforce- 
ment. The UCS for this group was a .33 
psi air puff. 
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Frequency curves of conditioning 
by blocks of 10 reinforced trials for 
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BLOCKS OF TEN REINFORCED TRIALS 


Fic. 1. Mean percentage of CRs in successive 


blocks of 10 reinforced trials. 


the four groups receiving partial 
reinforcement are shown in Fig. 1. 
Inspection of the figure reveals that 
only the 4.0-lb. psi group showed 
an appreciable performance increase, 
the other three groups remaining 
under the 20% CR level throughout 
the course of conditioning. Although 
the 1.0-lb. group was superior to the 
.33-lb. group the performance level of 
the .6-lb. group was slightly lower than 
that of the .33-lb. group. 

To statistically evaluate these dif- 
ferences a Kruskal-Wallis (1952) H 
test was employed with the scores 
for the four groups over conditioning 
Trials 51-70 (Trials 100-140). The 
resulting HT of 20.64 was significant 
at the .001 level. Subsequently, 
Mann-Whitney U tests were used 
to test the observed differences for 


TABLE 1 


MANN-WHITNEY U AND THE CORRESPONDING 
P VALUES FOR ALL PossIBLE Com- 
PARISONS OF THE GROUPS ON 
TRIALS 41-70 


Group Comparisons 


.33-— .6 
.33-1.0 
.33-4.0 
.6 -1.0 
.6 -4.0 
1.0—- 4.0 








EYELID CONDITIONING 


all possible pairs of puff strength 
values. Table 1 presents the U and 
corresponding P value for each of the 
It should be noted that 
while the 4.0-lb. group differed sig- 
nificantly from each of the other 
groups, none of the other comparisons 
reached the .05 level of significance. 
In view of statistical results 
the differences among the three low 
puff strength groups must be 
sidered to be fortuitous. It would 


comparisons. 


these 


con- 


appear that with UCS values of 1.0 
Ib. and less a low performance level 


is obtained and that within this range 
there is little, if any, positive relation- 
ship between. UCS 
conditioning performance. 

In order to 


intensity and 
function 
relating performance to 
UCS 50% partial and 
continuous reinforcement conditions, the 


compare the 
conditioning 
intensity for 


data of a number of published studies 
from the 
Except for the 
these 


used. 
UCS 
comparable 


lowa laboratory 
intensity of the 
highly 
to the present study in terms of appara- 
tus, conditioning procedure, and experi- 
mental conditions (i.e., CS, CS-UCS 
interval, intertrial interval, etc.). 


were 


studies were 


Since 
the use of a frequency measure imposes 
a ceiling on performance (100%), and a 
number of Ss in the strong puff groups 
had reached this ceiling, the comparisons 
were made in terms of the conditioning 
performance of homogeneous Ss, i.e., 
the middle 40% of each group. Thus 
the Ss from each group were rank 
ordered on the basis of the total number 
of responses made during the first 70 
conditioning trials and the mean per- 
centage of CRs made by the middle 40% 
of each distribution computed. 
Because of varying ratios of men to 
women this procedure followed 
for each sex separately and the mean 
percentage values obtained 
aged for each experiment. 


was 

was 
were aver- 
The results 
obtained are presented in Fig. 2. For 
reinforcement 
the .33-lb. point taken 
100% reinforcement group in the present 


the continuous function 


was from the 
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100% REINFORCEMENT 


50% REINFORCEMENT 


PER CENT CR'S,, REINFORCED TRIALS 41-70 





UCS INTENSITY (PS1.) 

FiG 2. Relationship between percentage 
of CRs and UCS intensity for groups of 
homogeneous Ss (see text) receiving partial 
and continuous reinforcement. 


experiment while the other points, from 
.6 to 4.0 Ib., respectively, were obtained 
from studies by Runquist and Ross 
(1958), Runquist and Spence (1959), 
Ross (1959), and Spence and 
(1959). All of the points on the partial 
reinforcement function were taken from 
the present experiment except for the 
2.0-lb. point which was obtained from 
an earlier study (Ross, 1959). 

From an inspection of Fig. 2 
evident that the shapes of the functions 
are quite different for the two reinforce- 
ment schedules. With continuous rein- 
forcement the function is negatively 
accelerated with small increases in puff 
strength at low absolute levels resulting 
in large increases in performance. In 
contrast, under 50% reinforcement the 
function is apparently s-shaped with 
little performance change over the low 
range of UCS intensity values. 

Just why such different results should 
be obtained with the introduction of 
nonreinforced trials into the conditioning 
situation is not However, it is 
possible that, if the decrement found 
with 50% reinforcement is to 
degree related to ‘‘set’’ factors, inhibi- 
tion of performance is more readily 
accomplished under conditions of low 
puff strengths. Thus, increases in puff 
strengths over this range would be inef 
fective in increasing performance. The 
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clear. 


some 
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large differences between the 100% and 
50% reinforcement groups at high levels 
of puff strength require that considerable 
“inhibition” still be present with such 
puffs. Stronger puffs, then, would 
increase performance but not completely 
overcome the “inhibition” effects of 50% 
reinforcement conditions. Further clari- 
fication of the differences between the 
functions would appear to be dependent 
upon added information concerning the 
role of inhibitory factors in partial 
reinforcement. 


SUMMARY 


Groups of Ss under a random 50% rein- 
forcement schedule were conditioned with 
different intensities of UCS (.33, .6, 1.0, 4.0 
psi) in an effort to provide information con- 
cerning the function relating conditioning 
performance under partial reinforcement con- 
ditions to the strength of the UCS. At UCS 
values of 1.0 lb. and less, a low performance 
level was obtained, with little indication of a 
positive relationship between performance 
and UCS intensity. A 4.0-lb. UCS resulted 
in performance significantly higher than that 
of the other groups. 

In order to compare the function relating 
conditioning to UCS strength obtained for 
partial reinforcement in the present experi- 
ment with one for continuous reinforcement, 
the data from a number of previously pub- 
lished studies were analyzed and a similar 
function determined for 100% reinforcement. 
The functions differed in shape, the partial 
reinforcement function being S-shaped, while 
that for continuous reinforcement appeared 
to be negatively accelerated throughout. 
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EFFECT OF TRIPLET AND QUADRUPLICATE LOCATION 
IN VERBAL MAZE PATTERNS UPON SERIAL 
POSITION ERRORS! 


GEDIMINAS NAMIKAS, CHARLES P. THOMPSON, ano W. J. BROGDEN 


University of Wisconsin 


The effect of change in pattern of 
the 16-unit, 4-choice verbal maze of 
Brogden and Schmidt (1954) upon 
its acquisition and form of the serial 
position error curve has been studied 
by Thompson, Voss, and Brogden 
(1957) and by Ernst, Hoffeld, Seiden- 
stein, and Brogden (1960). Altera- 
tion of the standard maze pattern 
by the introduction of a doublet (two 
like items) or a 


successive response 


split-doublet (two identical response 


different re- 
sponse item) changes the form of the 


items separated by a 


serial position error curve but does 
not affect speed of acquisition. <A 
doublet in each of five different loca- 
tions of the maze pattern produces an 
increase in error at the position of the 
second item. A split-doublet in each 
of four different locations (except 
for Positions 2-4) produces an in- 
crease in error at the third position 
(second like item). With the one 
exception for the split-doublet, there 
is no differential effect of doublet or 
split-doublet the 
pattern. 

The doublet effect is maximum for 
the total error measure early in 
acquisition and decreases in magni- 
tude thereafter. For 
measure, there is no 
over all ‘trials. For the first one- 
quarter of the doublet effect 
is present as an increase of error at 


location in maze 


the first error 
doublet effect 


trials 


1 This research was supported in part by 
grants from the National Science Foundation 
and the Research Committee of the Graduate 
School from funds provided by the Wisconsin 
Alumni Research Foundation. 


the position of the second item, but 
for the last three-quarters of trials 
the effect is reversed and occurs as a 
decrease in error at the second position 
of the doublet. The presence of a 
doublet in the maze pattern thus has 
an inhibitory effect upon acquisition 
during the discovery phase and this 
is a function primarily of the guessing 
habits of Ss, particularly the tendency 
not to repeat a response correct at 
the previous position. Once dis- 
covery is completed, the identity of 
the doublet items facilitates further 
acquisition of the second item of the 
doublet. 

The split-doublet phenomenon is 
apparently different from that for the 
doublet. There is no evidence of 
difference between the total error 
and first error measures over all 
trials or for stages of acquisition, nor 
of a relation between gtiessing habits 
of Ss and the split-doublet effect. 
These results suggest that there may 
be no consistent effect of a pattern 
dimension upon the serial position 
error curve and that variables repre- 
sentative of the dimension such as 
the doublet and split-doublet have spe- 
cific and independent effects. With- 
in the parameters of the 16-unit, 4 
choice verbal maze pattern, with each 
choice represented four times, the 
triplet (three successive like responses) 
and the quadruplicate (four successive 
like responses) are the only other 
sources of pattern variation that 
can be included in a dimension that 
also includes the doublet and split- 
doublet. The effects of the triplet 
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and quadruplicate upon the form 
of the serial position error curve were 
hypothesized to be different from 
those of the doublet and split-doublet, 
to be different from each other, to 
involve both increases and decreases 
in error as a function of position 
within the triplet or quadruplicate 
sequence, and to show no difference 
in character as a function of location 
within the maze pattern. It was 
hypothesized further that the intro- 
duction of a triplet or quadruplicate 
would increase the speed of acquisi- 
tion over that for the standard maze 
pattern. This paper presents the 
results of two experiments, one in 
which the triplet and its location 
represent the independent variable 
and the other for which the independ- 
ent variable is the quadruplicate and 
its location in the maze pattern.’ 
These two experiments follow closely 
the design of the earlier studies 
on the doublet and split-doublet in 
an attempt to test the hypotheses 
presented above. 


PROCEDURE 


Experiment I.—The four experimental 
groups acquired a modification of the stand- 
ard maze pattern which includes a single 
triplet located at Maze Units 3, 4, and 5, 
at 6, 7, and 8, at 9, 10, and 11, and at 12, 13, 
and 14, respectively. Each modified pattern 
was altered not only to locate the triplet at 
the appropriate positions, but also to exclude 
doublet, split-doublet, or other noticeable 
sequences in the maze. Each pattern was 
permuted to obtain four subpatterns and 
thus provide for each number (10, 20, 30, or 
40) to be represented once at each of the 16 
positions. Data for the control pattern is 
that reported by Ernst et al. (1960). 

Of 20 Ss in each of the four experimental 
groups, five learned each one of the four 
subpatterns. All of the Ss were treated by a 
single EZ. The Ss were volunteer students 


from the classes in elementary psychology 


2? Thompson collaborated with the senior 
author on the triplet experiment and Namikas 
on the experiment with the quadruplicate. 
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at the University of Wisconsin. Assignment 
of Ss to experimental conditions was random 
within each replication. 

The instructions of Thompson et al. (1957) 
were read to S. Responses of 10, 20, 30, or 
40 were made by S with E signaling the 
correct response by saying “Forward.” 
This procedure continued until S completed 
one errorless trial. Each response at each 
unit was recorded by E for each trial. A 
stop watch was used to measure the time of 
each trial and the 1-min. rest between trials. 
Other conditions were comparable to those 
of Thompson et al. (1957). 

Experiment IIT.—The patterns learned by 
the three experimental groups contained a 
single quadruplicate located at Maze Units 
3, 4, 5, and 6, at 7, 8, 9, and 10, and at 11, 12, 
13, and 14, respectively. Each pattern repre- 
sents a limited modification of the standard 
pattern to provide a quadruplicate at the 
desired locus without other noticeable pattern 
sequences elsewhere in the maze. In all other 
respects, the procedure for Exp. II was 
identical to that of Exp. I. 


RESULTS 


Experiment I.—Analyses of vari- 
ance of the data for time and trials 
for the four experimental and one 
control group showed no significant 
effect for maze pattern, maze pat- 
tern subgroups, or the interaction 
of these two variables. the 
assumption of homogeneity of vari- 
ance was found not tenable for the 
error data, the 1% level of significance 
was selected in lieu of the 5% level 
for all analyses. No source from the 
between-Ss variation was significant. 
Further analysis of the significant 


Since 


interaction of serial position by maze 
pattern was based upon several con- 
versions of the data. 

Total error to the criterional trial at 
each serial position was converted to 
percentage of total error for each S 
in each group (McCrary & Hunter, 
1953). The group mean of the 
percentage scores for Ss of each 
experimental group and the com- 
bined control group of Ernst et al. 
(1960) are presented in Fig. 1 by 
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Fic. 1. Serial position curves in terms 
of percentage of total errors and percentage 
of first errors as a function of triplet locus 


the solid line curves. There are 
substantial differences in the form 
of the curves for the experimental 
groups compared with each other and 
with the control group. These differ- 
ences are clearly shown by the solid 
line curves in the left half of Fig. 2. 
The data for these curves were ob- 
tained by taking the difference in 
percentage of error for each S and the 
mean percentage error for the com- 
bined control group at each of the 16 
serial positions and computing the 
mean of differences for each 
of the four experimental groups. For 
each group there is an increase in 
error for the third position of the 
triplet, and except for Group E,, 
there is a decrease in error for the 
first position. Analysis of variance 
of these data shows in each case a 
significant interaction of serial posi- 
tion and maze pattern and thus 
establishes significant differences in 
the form of the difference curves 
between the experimental groups and 
the control group and between the 
four experimental groups. 


these 
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The broken line curves of Fig. 1 
are the group mean percentage of 
first errors over all trials to the 
criteria and the broken line curves 
of Fig. 2 are the difference curves 
between each experimental group 
and the control group. The curves 
in the right half of Fig. 2 give the 
difference between percentage total 
error and percentage first error for 
the experimental groups. 
substantial differences 
all error and first 
occur at Positions 2 and 3 of the 
triplet. Analyses of variance of the 
two sets of first error data and of the 
differences between percentage total 
error and percentage first error show 
in each significant serial 
position by maze pattern interaction. 

The ¢ test was used to establish 
fiducial limits at the 1% level of 
significance for the three sets of 
mean difference data shown in Fig. 2. 


The only 
between the 
error measures 
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Fic. 2. Curves of differences between serial 
position curves for experimental minus control 
groups for percentage of total and first errors 
(left side) and curves of differences between 
these differences for each experimental group 
(right side). 
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These limits are +3.1 for all errors, 
+2.9 for first errors, and +1.6 for the 
difference between all errors and 
first errors. For the all error measure, 
the differences from the control group 
that equal or exceed the fiducial limits 
for triplet locus are the first triplet 
item (negative) for Group E;, and 
the third triplet item (positive) for 
Group E;. No positions outside of 
the triplet locus exceed the limits. 

For the first error measure, the first 
item of the triplet exceeds the negative 
limit for Groups E, and E3, and the 
second item of the triplet exceeds 
the negative limit for Groups E» and 
E,. Other significant differences are 
Positions 10 (positive) for E,, 14 
(positive) for Es, 4 (positive) for 
E;, and 10 (positive) for E,. 

For the difference between total 
error and first error, the second and 
third items of the triplet exceed the 
positive fiducial limit for every group. 
In addition, Position 10 (negative) 
exceeds the limits for E;, 10 (negative) 


and 14 (positive) for Es, 7 (negative) 
for E3, and 7 and 10 (negative) for 
Fy. 

The triplet effect is not entirely 
consistent for all groups and is of 
different character for the total and 


first error measures. It appears to be 
one of increasing total error over the 
three positions, starting with less 
error than that for the control pat- 
tern at the first position and terminat- 
ing with greater error. The means 
over all groups are —1.8, 0.0, and 2.1, 
respectively, for the three positions. 
For the first error measure, the effect 
seems to be a general reduction in 
error from that of the control group 
at all positions, that increases from 
the first to second item and decreases 
from the second to the third. The 
means over all groups are —1.5, 
— 2.6, and —0.8, respectively. These 
apparent trends were tested by or- 
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thoganol polynomial analysis of vari- 
ance (Grant, 1956) on the data at the 
triplet positions over all experimental 
groups for total error and for first 
errors. A significant linear trend was 
found for the total error measure 
and a significant quadratic trend 
occurred for the first error measure. 

Since Ernst et al. (1960) found the 
difference in doublet effect for the 
total and first error measures to be a 
function of stage of acquisition, the 
frequency of total and first error at 
the triplet locations were tabulated 
by quarters of trials to the criterional 
trial for the experimental groups and 
the control group. The E-CC dif- 
ferences were obtained for each S 
of the experimental groups and the 
over-all group means for total and 
first error measures computed for 
each triplet position. All differences 
for both measures are positive for 
the first quarter of trials and negative 
for each one of the last three quarters 
of trials except for the total error 
measure for the third triplet item 
for the second quarter. The linear 
trend for total errors and the quad- 
ratic trend for first errors do not 
occur consistently for each quarter 
of trials, but for both measures the 
effect appears to be maximum for 
the first quarter and to decline 
progressively over the remaining three 
quarters. Each measure shows its 
characteristic trend, but early in 
learning the trend occurs over errors 
greater than those for the control 
group and later in acquisition, over 
errors less than the control group. 
No statistical tests were made on 
these data. 

Experiment II.—Analyses of vari- 
ance of the data for time and trials 
for the three experimental and one 
control group showed no significant 
effect for maze pattern, maze pattern 
subgroups or the interaction. Since 
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the assumption of homogeneity of 
variance was untenable for the error 
data, the 1% level of significance 
was selected in lieu of the 5% level 
for all analyses. Treatment of the 
error data for this experiment follows 
that described already for Exp. I, 
with comparable results. The inter- 
action of Serial position X Maze pat- 
tern is significant for total errors, 
for percentage total errors, for per- 
centage first errors, for percentage 
total errors experimental minus con- 
trol, and for percentage first errors 
experimental minus control. The 
serial position error curves for per- 
centage of all error and for percentage 
of first error for the control group 
and the three experimental groups 
are presented in Fig. 3. The effect of 
the quadruplicate appears to be one 


of general reduction of error. This 
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is shown by the curves of difference 
between the experimental and control 
groups for percentage total errors and 
percentage first errors which are 
presented in the left half of Fig. 4. 
The curves in the right half of Fig. 4 
are the differences between percent- 
age total errors and percentage first 
errors for each experimental group. 
The curves for all error and for first 
error follow each other closely except 
that the first error curves for E; and 
E» appear to show a greater reduction 
in error for the quadruplicate locus 
than do those for total error. The 
curves of difference between total 
and first errors appear to reflect these 
differences primarily. Analysis of 
variance of the difference data be- 
tween total and first errors reveals 
a significant interaction of 
position and maze pattern. 
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Serial position curves in terms of percentage of total errors and percentage of 


first errors as a function of quadruplicate locus. 
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Fic. 4. Curves of differences between serial position curves for experimental minus control 
groups for percentage of total and first errors (left side) and curves of differences between these 
differences for each experimental group (right side). 


The fiducial limits at the 1% level 
of significance are +3.6 for all errors, 
+2.0 for first errors, and +2.6 for the 
difference between all errors and first 
errors. For the total error measure, 
the differences from the control group 
that equal or exceed the negative 
limit at the quadruplicate locus are 
Item 1 for Es, and Items 1 and 3 
for E;. The only other position that 
exceeds the limits is 14 (positive) 
for Ej. 

For the first error measure, quadru- 
plicate Items 2, 3, and 4 for Ey, 
1, 2, 3, and 4 for Es, and 1, 2, and 
3 for E; equal or exceed the negative 
limit. Other positions that exceed 
the limits are 7 (negative), 14, and 
16 (positive) for E,, 3, 4, 5, and 12 
(positive) for Ex, and 5 and 10 (posi- 
tive) for E;. The fiducial limits for 
the curves of difference between all 


errors and first errors are exceeded 
by Item 3 (positive) of the quadru- 
plicate for E, and by Position 15 
(positive) for the same group. 

Orthogonal polynomial analyses of 
variance (Grant, 1956) were com- 
pleted on the E-CC data of the 
quadruplicate positions over all ex- 
perimental groups for total errors 
and for first errors. The over-all 
group means are —3.3, —2.1, —2.3, 
and 0.1, respectively, for total errors, 
and —1.0, —0.7, —0.5 and 0.5, 
respectively, for first errors. Sig- 
nificant linear, quadratic, and cubic 
components were obtained for the 
total error data, and significant quad- 
ratic and cubic components were 
obtained for the first error data. 

The E-CC data for total and first 
errors at the positions of the quadru- 
plicate were each tabulated by quar- 
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ters of trials to the criterial trial. All 
means over groups show less error 
than the control group except the 
last three positions of the quadrupli- 
cate for the first quarter of trials for 
the total error measure. These latter 
means are progressively positive in 
magnitude. The trend over the four 
positions of the quadruplicate is more 
apparent during the first two quarters 
of trials for both measures and is 
more evident with the total error 
measure than with the first ‘error 
measure. No statistical tests were 
made of these data. 


DISCUSSION 


The two experiments of this study 
establish a triplet effect and a quadrupli- 
cate effect on the form of the serial 
position error curve. Locus of the 


triplet and the quadruplicate does not 
appear to be differential, except that 
location of the pattern early in the maze 
(3, 4, 5 for the triplet and 3, 4, 5, 6 for 
the quadruplicate) produces a less clear- 


cut effect than does location of either 
pattern elsewhere. The general form 
of each effect is different for the total 
error and first error measures. For 
the triplet, there is a linear trend for 
total error, starting with a decrease 
in error relative to the control pattern 
for the first item, no difference for the 
second item, and an increase in error 
for the final item. There is a quadratic 
trend for the first error measure, with 
less error than the control pattern for 
each of the triplet items. For the 
quadruplicate, both the total error and 
first error measures show less error than 
the control pattern for the first three 
positions and a zero or small positive 
difference for the fourth item. The 
general trend for the total error measure 
has linear, quadratic, and cubic com- 
ponents and that for the first error 
measure has quadratic and cubic com- 
ponents. For both the triplet and quad- 
ruplicate, there are other differences in 
form of the serial position error curve 
that may not be accounted for by the 


389 


location of either pattern or by the 
interchanges of response items necessary 
to pattern formation. 

The triplet and quadruplicate effects 
are not only of different character from 
each other but differ also from the 
doublet and the split-doublet effects 
(Ernst et al., 1960). The doublet effect 
stands apart from the other three pattern 
effects in that it is in part a function of 
the guessing habits of Ss and the effect 
measured by first errors is diphasic. 
Although the quadruplicate effect as 
measured by total errors shifts from 
positive error relative to the control 
group for the first quarter of trials to 
negative for each of the last quarters, 
the trend over both triplet and quad- 
ruplicate positions is relatively stable 
over the course of acquisition. The 
differences in the triplet and quadrupli- 
cate effects .between total and first 
errors appear to be a function of the dif- 
ferences in the nature of these measures. 

It does appear that as the number 
of like items is increased, there is a 
progressive decrease in error at the 
locus involved, going from an initial 
increase in error relative to the control 
pattern for the doublet to a decrease in 
error maximum for the quadruplicate. 
All show a trend within the pattern 
of a relative increase in error from the 
initial to succeeding items. Each repre- 
sents a characteristic modification in the 
form of the serial position error curve 
without affecting the over-all measures of 
acquisition. 

The quadruplicate represents the up- 
per limit of a pattern dimension involving 
succeeding like items with fixed param- 
eters of a 16-unit maze of four choices, 
each of which occurs as the correct 
response four times. Longer mazes with 
a greater number of choices would permit 
extension of the variable of number of 
succeeding like items and should produce 
larger decrements in error relative to 
the control pattern than that produced 
by the quadruplicate. It is likely also 
that the pattern effects beyond the 
quadruplicate in longer mazes will pro- 
duce an over-all decrease in measures 
of acquisition relative to the control 
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pattern and differential effects as a 
function of locus. The results from the 
triplet and quadruplicate studies suggest 
that their effects are reduced for the 


early positions of the maze, as is the 
case also for the split-doublet. 


SUMMARY 


Two experiments were designed to test 
respectively the effect of location of a triplet 
and a quadruplicate on the form of the serial 
position error curve from the acquisition of 
a 4-choice, 16-unit verbal maze. Significant 
differences were obtained in both experiments 
in the form of the curves between each experi- 
mental group and the control group, and 
between experimental groups. The effect 
was less clear for the locus early in the pattern 
(3, 4, 5 for triplet and 3, 4, 5, 6 for quadrupli- 
cate) than for the other locations. The 
triplet effect is a linear trend over the three 
positions for the total error measure, with 
over-all mean differences relative to the 
control group of —1.8, 0.0, and 2.1. For 
the first error measure, the respective means 
are —1.5, —2.6, and —0.8, with a significant 
quadratic trend. The quadruplicate effect 
is represented by mean differences over the 
four positions of —3.3, —2.1, —2.3, and 0.1 
for total errors and —1.0, —0.7, —0.5, and 
0.5 for first errors. For the former, there 
are linear, quadratic, and cubic components, 
and for the latter, quadratic and cubic com- 
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ponents. No relation was found between 
these phenomena and the guessing habits 
of Ss. The phenomena are relatively stable 
over successive stages of acquisition, and 
there are no marked differences between 
total and first errors at the different stages 
of acquisition. The results are discussed 
relative to the doublet and _ split-doublet 
effects and in relation to a general pattern 
dimension and its effect upon the serial 
position error curve. 
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EFFECT OF CONCURRENT RESPONSES ON THE 
EVOCATION AND GENERALIZATION OF 
THE CONDITIONED EYEBLINK! 


G. ROBERT GRICE? ann JOHN D. DAVIS* 


University of Illinois 


Recently, the authors (Grice & 
Davis, 1958) have presented evidence 
for a mediated generalization effect 
in human eyelid conditioning. The 
experiments involved a differential 
conditioning paradigm in which the 
middle of three tones served as the 
CS. The made concurrent 
manual reaction-time responses to 
the three tones. It was found that 
significantly more CRs were made 
to the negative stimulus to which 
the manual response was the same 
as the CS than to the one to which 
the manual was different. 
This asymmetrical generalization gra- 


Ss also 


response 


dient was interpreted as being in 
accord with a theory of mediated 
stimulus generalization, and the ad- 
vantage was claimed for the design 


that the “mediating reactions’ were 
overt and subject to experimental 
control. The results of these experi- 
ments were seriously complicated, 
however, by a direct facilitative 
effect of the manual reactions on the 
eyeblink response. This effect ap- 
peared to be more powerful than the 
mediational one and introduced some 
danger in the interpretation of the 
experiments. For example, if the 
manual reactions to the “Same Re- 
sponse’”’ tone should be faster than 
to the ‘Different Response’ tone 
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the asymmetrical generalization could 
be an artifactual result of differential 
facilitation, rather than an associa- 
tive, mediational effect. 

The first purpose of the present 
experiments is to investigate the 
relation between these two effects by 
means of an analysis of the relation 
between the latencies of the manual 
reactions and the occurrence of eye- 
blinks. The second purpose is to 
compare the mediational effect in a 
situation where the CS and the gen- 
eralized stimuli are in the same sense 
modality with one in which they are 
different. Theoretically, while pri- 
mary stimulus generalization would 
be substantially greater within one 
modality, the amount of mediated 
generalization should be independent 
of the fact that the CS and general- 
ized stimuli are in different sensory 
dimensions. 


METHOD 
A pparatus 


The experimental situation was the same 
as previously described (Grice & Davis, 
1958) with the exception of certain improve- 
ments in instrumentation. The tones were 
presented by means of Permoflux PDR- 
8 earphones rather than by means of a loud- 
speaker. The same three tones, 240, 850, 
and 1900 cps, were presented at 70 db loudness 
level with reference to an average threshold 
for 1000 cps. They were switched on and off 
by means of a Grayson-Stadler electronic 
switch, which was set with a 30 msec. rise 
and decay time. A continuous “white” 
masking noise of 20 db sensation Jevel was 
also presented. The UCS was a puff of air 
of 1 lb./sq. in. pressure, and the CS-UCS 
interval was .5 sec. Programming equipment 
made possible the automatic presentation of 
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the entire experimental session of 100 trials, 
with attendant reduction of E error and 
general increase in precision. 

The manual reaction latencies were re- 
corded on the oscillograph tape with the CRs 
and were read to the nearest 8 msec. Reac- 
tions in excess of 640 msec. were recorded 
at that value. 


Procedure 


Experiment I.—With the exception of the 
apparatus differences noted, the procedure 
was the same as in Exp. | of the previous 
paper. The same counterbalancing of the 
high and low tones and right and left hand 
responses was employed. The middle tone 
always served as the CS, the air puff never 
being paired with either of the other tones. 
Criteria for scoring CRs and elimination of 
voluntary responders were the same, although 
no voluntary responders were actually 
eliminated in either experiment. 

The Ss were 12 men and 12 women stu- 
dents who served as part of a requirement 
for a course in general psychology. Eight 
additional Ss who made fewer than 12 CRs 
were eliminated. The number of errors in 
the reaction-time task ranged from 0 to 6 
with a mean of 2.0 and a median of 1.85. 

Experiment II.—Procedure was the same 
as in Exp. I, except that the CS was a frosted 
7-w. bulb placed ? in. behind the 1-in. milk 
glass disk used as a fixation point in the 
previous experiments. The negative stimuli 
were the same 240- and 1900-cps tones used 
in Exp. I. 

The Ss were 12 men and 12 women from 
the same population as Exp. I. Three Ss 
who made fewer than 12 CRs were eliminated. 
Errors on the reaction-time task ranged from 
0 to 5, with a mean of 2.21 and a median 
of 2.0. 
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RESULTS 


All data presented are for Trials 
41-100, which included 20 presenta- 
tions of each of the three stimuli. 
The mean number of CRs to each 
of the three stimuli in the two experi- 
ments are presented in Table 1. 
The predicted asymmetry was ob- 
tained in both experiments, there 
being more responses to the negative 
stimulus to which the manual reac- 
tion was the same as to the CS than 
to the one for which it was different. 
Experiment I replicates the results 
of Exp. I in the previous paper except 
that the above-100% generalization 
to the Same Response stimulus found 
in that experiment was not obtained. 

Mean latencies of the manual 
responses are presented in Table 2. 
For Exp. I, the reaction times were 
somewhat faster to the Same Re- 
sponse stimulus. There was little 
difference between the reaction times 
to these two stimuli in Exp. II. 

In order to more fully evaluate the 
effects of the mediational and facilita- 
tive effects, point biserial correlation 
coefficients were computed over trials 
between reaction time and the oc- 
currence or nonoccurrence of CRs 
for each S separately. The means 
of these correlations, based on the z 
transformation, are presented in Table 
2. The number of CRs to each stimu- 


TABLE 1 


MEANS OF UNADJUSTED AND ADJUSTED NUMBER OF CRs 





Experiment I 


Stimulus Unadjusted 


} Ss Mean 


Same Resp. 
CS 


Diff. Resp. 





| Experiment II 


Adjusted Unadjusted Adjusted 


Mean | 
8.33 

16.00 
5.96 
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TABLE 2 


REACTION-TIMI 


MEANS AND AVERAGE Point BISERIAL CORRELATIONS BETWEEN 


REACTION TIME AND CR OCCURRENCE 


Reaction Time in Msec. 


Stimulus Exp. I Exp. Il 


Mean* SD 


Same Resp. | 508 73 


CS 
Diff. Resp. 


526 61 


* Mean of S means. 
» Obtained by ¢ transformation. 


lus was then adjusted for each S 
separately by means of his own “‘with- 
in stimulus” regression coefficient. 
This leaves the S totals unchanged, 
but adjusts the number of CRs to 
each stimulus for each S for reaction 
time, and, presumably, for within-S 
variation in the facilitative effect. 
Means of the adjusted measures 
are presented in Table 1. It may 
be noted that the asymmetry of 
generalization was preserved for both 
experiments. 

Separate analyses of variance of the 
adjusted numbers of CRs for the two 


TABLE 3 


ANALYSES OF VARIANCE OF ADJUSTED 
NUMBER OF CONDITIONED RESPONSI 


Source 
Exp. I Exp. II 

Between Ss 
Tone arrangement (T) .0005 12 
Hand arrangement (H) | 35 88 
T XH 4.84 .002 
Error (b) (37.85)* | (62.52)* 

Within Ss 
Generalization (G) 
G XT 


5.50* 
4.25* 
97 
04 
(7.38)* 


48.03** 


Error (w) 


Total 


* Numbers in parentheses are error mean squares. 
*P < .025. 
*P < .001. 


Mean* SD 


500 59 
509 78 454 36 
503 64 


Exp. I Exp. II 


Mean? Range Mean? Range 


— .63 
—.59 
— 46 


+.30to —.89)| —.! 
—.08 to —.81|) —. 
+ .04 to —.86|) —.2 


experiments are presented in Table 3. 
While the Generalization effect is 
significant for both experiments, the 
appropriate test for the predicted 
asymmetry of the mediation hypoth- 
esis is a ¢ test, based on the within-Ss 
error, of the differences between the 
Same Response and Different Re- 
sponse negative stimuli. This test 
was significant for Exp. II, (P < .02) 
and of borderline significance for 
Exp. I. (P < .05, one tail). 

The negative stimuli were identical 
in the two experiments. Table 4 
is the summary of a combined analysis 
of the adjusted numbers of CRs to 
the two negative tones only. The 
significant effect for Experiments indi- 
cates the obvious finding of greater 
primary generalization from a tone 
to other tones than from a light to 
tones. Since responses to the CS 
were omitted from this analysis, the 
F for the Generalization effect is a 
direct test of the asymmetry of 
generalization. It was highly signifi- 
cant for the two experiments com- 
bined, and the F for the interaction 
of the Generalization effect with 
Experiments was less than unity. 
The significant Generalization X Tone 
Arrangement interaction was pro- 
duced by the fact that there were 
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TABLE 4 
ANALYSIS OF ADJUSTED NUMBER OF CRs 
TO NEGATIVE STIMULI IN 
Exp. I anp II 


a 


Source 


= 


Between Ss 
Tone arrangement (T) 
Hand arrangement (H) 
E Pe (E) 
ie 
ic ° ; 
H XE 
TXHXE 
Error (b) 
Within Ss 
Generalizé ation (G) 


.03 

| 1.35 

10.21* 

1.37 

: .02 

28.39; .60 

49.22/ 1.04 
47.51 | 


as 
oO Me ee 


82. 10 | 15.49** 


+ 
i a a... 2) 


~ 


Error (w) 


o 
wn 


Total 


*P < .005. 
PP < O01. 


somewhat more CRs to the high tone 
than to the low tone in the counter- 
balanced arrangement. 

The assumption of the operation 
of both a facilitative effect and an 


associative, mediational effect leads 
to a prediction concerning the cor- 
relation across Ss between mean reac- 
tion time and the number of CRs 
for. the two negative stimuli. The 
facilitative effect should tend to pro- 
duce a negative correlation between 
these variables for both tones. For 
the Same Response stimulus, where 
the mediated equivalence effect is 
supposed to be operating, there should 
be a second source of negative correla- 
tion. That is, the earlier the mediat- 
ing response occurs, the greater its 
chance of eliciting the CR within 
the scoring interval. This should 
not be the case, however, for the 
Different Response stimulus. This 
source of negative correlation should 
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be absent, or, if a mediated distinc- 
tiveness effect is operating, there 
should be a source of positive correla- 
tion. This reasoning leads to the 
rather clear prediction that there 
should be a higher negative correla- 
tion between reaction time and the 
number of CRs for the Same Response 
stimulus. For Exp. I this correlation 
was —.24 for the Same Response tone, 
and +.18 for the Different Response 
tone. The corresponding correlations 
for Exp. Il were —.62 and —.44, 
respectively. Both differences were 
in the predicted direction. Employ- 
ing the test for significance of a differ- 
ence between related correlation coef- 
ficients, neither difference was quite 
significant. However, when the prob- 
abilities were combined by the Fisher- 
Pearson chi square procedure, the 
difference was significant for the two 
experiments combined (P = .044, two- 
tailed test). 


DISCUSSION 


It is the opinion of the authors that 
the results of these experiments should 
be interpreted with considerable caution. 
Since the facilitative effect is obviously 
a powerful one while the mediational 
one is relatively small, the danger of 
artifactual results must not be ignored. 
But in view of the important role as- 
signed to mediated generalization in the 
theoretical literature, it seems important 
to ascertain whether it does operate at 
the level of the conditioned response, 
which is commonly regarded as a rather 
fundamental form of associative learning. 
The analyses of the conditioning meas- 
ures, adjusted for their correlation with 
reaction time, appear to provide moder- 
ately good evidence that a mediational 
effect was operating in these experiments 
as well as the facilitative one. The 
verified prediction concerning the higher 
negative correlation between generaliza- 
tion and reaction time for the “equiva- 
lent” stimulus as opposed to the ‘‘distinc- 
tive’’ one provides corroborative evi- 
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dence. The 
hypothesis 


acceptance of the null 
concerning the interaction 
of the generalization effect with experi- 
ments is in accord with the prediction 
that the amount of mediated generaliza- 
tion should be independent of whether 
or not the generalized stimuli affect 
the same sensory modality as the CS. 

It is obvious to suggest that mediation 
should be studied in the absence of 
facilitative effects, since statistical con- 
trol of extraneous variables is 
completely satisfactory. In 
stances this may be possible. Yet if 
one wishes to study the associative 
consequences of a response at the time 
of its occurrence, one must be prepared 
to consider its other contemporary 
effects. This is not a problem specific 
to the present eyeblink experiments. 

The strong facilitative effect which has 
been found in this series of experiments 
is of some interest and importance in its 
own right. Of immediate interest, is the 
role which it has probably played in 
other eyeblink studies which have in- 


never 
some in- 


volved concurrent motor responses dur- 
ing conditioning. For example, the 
study of Voeks (1954), purporting to 
show step-wise conditioning curves, in- 
volved a sort of two-hand reaction-time 
response which she conceived to be 
part of the true CS. The closure of two 
keys presented an auditory CS, the keys 
then being held down against a “stiff 
spring’ for the duration of this signal. 
Responses were counted as CRs if they 
occurred anytime after the onset of the 
auditory stimulus. It would be difficult, 
design a_ situation 
calculated to facilitative 
effect. This is, in part, evidenced by the 
fact that her ‘‘conditioning”’ level reached 
approximately 35% on Trials 2 and 3 
and 60% on Trials 4 and 5. 
doubtful that 


indeed, to better 


produce a 


It is ex- 


tremely this situation 


involved conditioning to any important 


degree. Similar complications would be 
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involved in such studies as those of 
Miller and Cole (1936) and Beck (1939). 


SUMMARY 


In two differential eyeblink conditioning 
experiments, Ss made more CRs to negative 
stimuli accompanied by the same manual 
response as the CS, than they did to stimuli 
to which they made a different manual 
response. The faster the reaction time of 
the manual response, the more probable a 
CR, irrespective of which stimulus was 
involved. This was interpreted as evidence 
for a facilitation effect. The asymmetry of 
generalization still present, however, 
when the conditioning measures were adjusted 
for correlation with reaction time. This was 
interpreted as indicating the probable action 
of an associative, mediated generalization 
effect. The correlation between number of 
CRs and reaction time was significantly more 
negative for the “equivalent’’ generalized 
stimuli than for the ‘distinctive’ stimuli. 
This was also interpreted as indicating the 
presence of both facilitative and mediational 
effects. 

There was no evidence that the magnitude 
of the mediation effect was different when 
generalization was from one tone to another 
than when it was from a light to a tone. 

Implications of the facilitative effect for 
other eyeblink studies involving concurrent 
motor responses were indicated. 


was 
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FAMILIARIZATION IN VERBAL DISCRIMI- 
NATION LEARNING 
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Paired-associate and serial verbal 
learning may be conceived of as 
consisting of two stages. The first 
stage consists of delimiting the popu- 
lation of responses, i.e., learning what 
the responses are. The second stage 
consists of pairing each response with 
its correct stimulus or learning the 
correct order of these responses. 
Although logically the first stage 
must precede the second, it is likely 
that the two occur concomitantly 
in the ordinary learning situation. 

Several studies of the role of asso- 
ciation value and familiarization of 
syllables in paired-associate and serial 
learning have suggested that factors 
associated with responding are more 
important than “perceptual’’ factors 
in acquisition of verbal lists (e.g., 
Cieutat, Stockwell, & Noble, 1958; 
Mandler & Campbell, 1957; Noble, 
1952, 1955; Noble & McNeely, 1957). 
One hypothesis which may be ad- 
vanced concerning these results is 
that the primary influence of asso- 
ciation value and familiarization is in 
the first stage (the response-learning 
phase) of acquisition. According to 
this hypothesis, the letters making up 
a nonsense syllable must be integrated 
into a single response before cor- 
rect anticipations can occur (Mand- 
ler, 1954). Syllables of high associa- 
tion value and familiarity are con- 
ceived of as representing verbal habits 
established outside the experiment 
and thus are more readily integrated. 
The number of responses available 
on any given trial is then determined 
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by how far this integration stage 
has progressed. 

However, suppose a learning task 
is used in which the total number of 
available responses is determined by 
the task, rather than the rate of 
response integration. This may be 
accomplished by using a procedure 
called verbal discrimination learning, 
where S is presented with a pair of 
syllables and must select the response 
arbitrarily designated as ‘‘correct”’ 
from these two. The response popu- 
lation from which S must make his 
choice is thus restricted to the two 
syllables presented, and integration 
in the sense described above is un- 
necessary. It might then be expected 
that neither association value nor 
familiarization would have much ef- 
fect upon acquisition in this task. 

The following experiments are de- 
signed to investigate the role of both 
association value and familiarization 
on the verbal discrimination learning 
of nonsense syllables. 


EXPERIMENT | 


The first two experiments were 
concerned with the effects of associa- 
tion value. Two methods of varying 
association value are available for 
verbal discrimination learning: A pair 
may be made up of one high and one 
low association value syllable, or the 
two syllables making up a pair may 
both be high or both low association 
value. According to the hypothesis, 
there seemed to be no reason to favor 
one method or the other. Although 
this conclusion later appeared to be 
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in error, for the first experiment the 
first method 
was used. 


(heterogeneous pairs) 


Method 


Design and materials.—Two sets of eight 
pairs of syllables were constructed from 
Glaze’s lists (1928). In each pair one syllable 
was 80% to 100°, association value and one 
value. 
Formal similarity was kept to a minimum 
both within and between sets, and the same 
number of letters was duplicated within a set 


syllable was 0% to 27° association 


These sets were then combined to make a list 
of 16 pairs of syllables. For half of the Ss, 
the high association value syllables of Set I 
and the low association syllables of Set II 
were correct. For the other half of the Ss, 
the low association value syllables of Set | 
and the high association value syllables of 
This reduced the chances 
differential 
or interference with specific syl- 
was felt 


Set Il were correct 


of extraneous factors such as 
difficulty 
lables influencing the results. It 
that having both high and low 


value syllables correct in the same list would 


association 


avoid Ss recognizing, for example, that all 
the “‘word-like” 

Procedure.—The 16 pairs in each list were 
arranged in three orders and presented with 
a 16-mm. filmstrip projector. The pairs were 
flashed from the back of the screen so that S 
ould not sec 


ones were correct. 


E or the projector during the 
presentation illumination was re- 
duced to distraction. Syllables 
appeared black on a white background and 

1 They appeared on an 8 X 8- 


General 
minimized 


were } in. high. 
in. milk glass piece in the center of a 3 X 3-ft. 
The two syllables 
pre sented one above the 
followed by 
rhe intertrial interval was 4 sec. The posi- 
tion of the syllables in a pair varied from trial 
to trial. 
they 


wooden screen. were 


sec. 


other for 2 
the correct syllable for 2 


sec. 


15 trials unless 
three consecutive trials with no 
They were then stopped and given 
a perfect score for the remaining trials. The 
Ss, 20 volunteers from Hobart and William 
Smith Colleges, were instructed to select and 
spell aloud the syllable designated as correct 
on each presentation including the first trial. 
\ll Ss were naive to verbal learning experi- 


ments. 


All Ss were given 
went 
error. 


Results 


lhe percentages of correct anticipa- 
tions for the high association value 
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Fic. 1. Percentage of correct 
on each trial for high and 
value 
pairs) 


responses 
low association 
(heterogeneous 


syllables in Exp. | 


syllables and low association value 
syllables on each trial are shown in 
Fig. 1. The hypothesis that varia- 
tions in association value would not 
produce performance differences is 
not supported by these data. Sta- 
tistically, the mean total correct 
anticipations for the high association 
value syllables is significantly greater 
than for the low association value 
syllables (¢ = 3.65). Seventeen of 
the 20 Ss gave more correct responses 
to the high than to the low association 
syllables. 


EXPERIMENT I] 


Rather than completely reject the 
hypothesis on the basis of this single 
experiment, explanation was sought 
in terms of other factors associated 
with making the The 
facilitation in performance obtained 
here may have been due to greater 
ease of responding with -the high 
association syllables. Essentially this 
notion is what Noble (1955) has 
called “‘motor patterning.” If it is 
easier to learn to make the high 
association value responses than the 
low association value responses, some 
facilitation might be expected. This 
kind of response integration 
although only the motor aspect is 


response. 


is a 
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being referred to here. If the pairs, 
however, are constructed of two high 
or two low association value syllables, 
greater ease of responding to high 
association value syllables cannot 
determine which response is made, 
since one of the two responses must 
be made each time a pair is presented. 
The second study, then, compared 
verbal discrimination learning in which 
pairs were homogeneous, i.e., either 
both syllables had high or both syl- 
lables had low association value. 


Method 


The same syllables were used as in Exp. I, 
except that they were re-paired to form two 
sets of eight sylables in which Set I contained 
only high association value syllables and Set 
Il only low association value syllables. 
Duplicated letters were again kept at a 
minimum and equalized within each set. 
Two lists were used consisting of the two sets 
combined, with the correct member of each 
pair in List 1 being the incorrect member of 
the same pair in List 2. Half of the Ss learned 
each list. 

The Ss were 20 volunteers from Hobart 
and William Smith Colleges. All were naive 
with respect to verbal learning. The pro- 
cedure was identical to that in Exp. I. 


Results 

The acquisition curves for the high 
and.low association value pairs are 
presented in Fig. 2. Although there 
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Fic. 2. Percentage of correct responses on 
each trial for high and Jow association value 
syllables in Exp. II (homogeneous pairs). 
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is some evidence for better perform- 
ance on the high association value 
pairs, the differences are much smaller 
than those obtained in Exp. I. Com- 
parison of the mean total correct 
responses for the two kinds of syi- 
lables resulted in a t of 2.07, which is 
just short of significance at the .05 
level for 19 df. Although a constant 
number of trials was used, all Ss 
obtained all eight items within each 
set correct on at least one trial. 
Thus trials to 8/8 criterion may be 
used as a measure of acquisition rate. 
Seventeen of the 20 Ss reached this 
criterion first on the high association 
value pairs. A comparison of the 
mean trials to this criterion for the 
two kinds of pairs resulted in a ¢ of 
3.45. It thus appears that the differ- 
ence in performance shown in Fig. 2 
is reliable, although a small one. 

A comparison was made between 
Exp. I and II of the mean difference 
between the total correct responses 
to high and low association value 
pairs. The value of t obtained was 
1.73 which is only significant at about 
the .08 level. 


The results of Exp. I and II do not 
support the hypothesis that the facilita- 
tive effects of association 
attributable 
learning 


value can be 
solely to the response- 
Although there are 
several interpretations of these differ- 
ences, the one favored here is suggested 
by Underwood and Richardson (1958). 
High association value syllables may 
provide some sort of extra cue value, 
often called ‘‘meaning,’’ thus making 
them susceptible to interference 
from duplicated letters. Since in the 
lists used in these experiments it was 
impossible to avoid 
of letters, it is not 


phase. 


less 


some duplication 
unreasonable to 
assume that the performance on the low 
association items in Exp. II was de- 
pressed by the slight amount of inter- 
ference present. Some support for this 
position comes from the fact that a pilot 
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study using 10 homogeneous pairs in 
which there were no duplicated letters, 
except a few vowels, produced no differ- 
ences between high and low association 
value pairs (4 = 1.27 with 19 df). The 
experiments used lists of 16 
pairs because it was felt that acquisition 
10-item list to 
give any differences a chance to appear. 
Increasing the length of the list neces- 
the duplicated 
due to the restricted number of letters 
and syllables available, thus increasing 
interference. The difference 
in Exp. II, while not significantly smaller 
that Exp. I, 
support the hypothesis that the differ- 
the latter 
mostly due to differential ease of making 


present 


was too rapid with a 


sarily increases letters 


obtained 


than obtained in tends to 


ence in experiement was 


the response. 


EXPERIMENTS III AND IV 


The purpose of these two studies 
was to compare the effects produced 
by the familiarization of 
syllables (Noble, 1955) with 
produced by association value. 


nonsense 


those 


Method 


rhe rules for the 
construction of lists were the same as those 
in the previous experiments, with the excep- 
if 


tion that all syllables were from 0% to 27% 
association value (Glaze, 1928). 


Design and materials. 


The number 
of duplicated letters within a set was slightly 
higher for these lists than in Exp. I and II. 
As in the previous studies, a given syllable 
in each pair was correct for half the Ss and 
incorrect for the other half. 

Procedure.—Prior to learning the list each 
S was given familiarization training on half 
the syllables. 

Half the syllables were correct in learning 
and half the syllables were incorrect For 
Exp. III, one syllable was chosen from each 
pair ; for Exp. IV, both syllables of half of 
the pairs were used. Two different familiari- 
zation lists were employed in each experiment 
so that syllable difficulty, etc., was partially 
counterbalanced between the familiarized 
and nonfamiliarized syllables. In familiariza- 
tion training each syllable was exposed singly 
on the screen for 2 sec. The S was instructed 
to spell out each syllable as it appeared. The 
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Fic. 3. Percentage of correct responses 
on each trial for familiar and nonfamiliar 
syllables in Exp. III (heterogeneous pairs). 


16 syllables were arranged in six different 
orders, but there interval between 
orders. No syllable preceded or followed 
any other syllable more than once. Each 
syllable was seen and spelled 12 times by each 
S 


was no 


The verbal discrimination learning pro- 
cedure identical with that in Exp. I 
and II. Twenty volunteers from the Hobart 
and William Smith students 
in each study. None of the Ss had served 
in a verbal learning experiment before. 


was 


served as Ss 


Results 


The percentage of correct responses 
on each trial for familiarized and non- 
familiarized syllables in Exp. III is 
shown in Fig. 3. With heterogeneous 
performance level is 
the familiarized 

than when it is 


pairs 
when 
correct 


higher 
syllable is 
incorrect , 





20 3S 
oo 86 


Pm 
NON FAMILIAR 4-—9-* 
pe FAMILIAR 

is 


* 


N“ 
° 


/s 


~~ 


PER CENT CORRECT 
o. 8 


+ 
° 








= eee Se ee 
1234 567 8 9 101 i2 1314 15 
TRIALS 





Fic. 4 Percentage of correct responses on 
each trial for familiar and nonfamiliar syl- 
lables in Exp. 1V (homogeneous pairs). 
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with the advantage being particularly 
marked in the early trials. A ¢ test 
comparing the mean total correct 
responses for the two kinds of syl- 
lables resulted in a value of 3.78 
(P < .01). 

The acquisition curves for Exp. IV 
are presented in Fig. 4. In_ this 
experiment, however, there is no 
performance between 
pairs of familiarized and nonfamiliar- 
ized _ syllables. 


difference in 


Comparison of the 
mean total correct responses gave a 
t of .33, which is clearly not significant. 


DISCUSSION 


The effects of familiarization training 
on verbal discrimination learning are 
not the same as the effects of association 
value. The fact that familiarization 
produced facilitation in the case of 
heterogeneous pairs is consistent with 
the effects produced by association value, 
although there are some slight differ- 
ences. In the case of association value, 
performance on the two kinds of pairs 
starts at the same level, then diverges 
before converging again near the asymp- 
tote. Familiarization, on the other hand, 
produces large differences on the first 
trial. This large initial difference in 
the case of familiarization is probably 
produced by a tendency to guess and 
to keep guessing the syllables which have 
just been seen. This is clearly shown 
by the fact that performance on the 
nonfamiliarized syllables begins below 
the chance level (at 39%), while per- 
formance on the familiarized syllables 
begins at 57%. These results are also 
consistent with the hypothesis attri- 
buting facilitation to differential ease 
of responding for the two kinds of 
syllables. Since the differential ease 
of responding might not be established 
until after a few 
value 


trials when association 
varied, facilitation did not 
take place until later in learning, whereas 
familiarization training established these 
habits prior to learning the list. 

The striking result of these studies, 


was 
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however, is the failure of familiarization 
training to produce differences in per- 
formance when homogeneous pairs are 
used. This result is also consistent 
with the interpretation given the asso- 
ciation value experiments. If the differ- 
ences which occurred in the homogeneous 
pairs association value experiment are 
attributed to the cue value of association 
or “meaning,” familiarization training 
should not produce differences since 
it can be shown that familiarizing opera- 
tions, in general, do not increase asso- 
ciation value (Riley & Phillips, 1957). 
It might be argued that a difference 
would have been obtained in Exp. IV 
had familiarization training been con- 
tinued. This cannot, of 
denied; however, the 12 
here were sufficient to produce large 
differences in Exp. III when conditions 
for producing these differences were 
allowed. In fact, the purpose of Exp. 
I11 was to ensure that the familiarizing 
operations were sufficient to produce 
performance differences in one situation 
where these differences might be expected. 

It is felt that, over-all, the results 
support a response-oriented interpreta- 
tion of the effects of association 
and familiarization, although they do 
indicate some cue effect of association 
value. The results of Exp. IV represent 


course, be 
trials given 


value 


the only real support for the original 
predictions from the response integra- 
tion hypothesis that association value 
and familiarization would not facilitate 
verbal discrimination learning. 


Never- 
theless, an attempt made to 
that complicating factors 
present in the case of 
pairs. 


has been 
show were 


heterogeneous 


SUMMARY 


Four verbal] discrimination learning experi- 
ments were performed to test the hypotheses 
that association value and familiarization 
with the syllables would not facilitate per- 
formance. Two experiments varied associa- 
tion value and two gave familiarization train- 
ing on the syllables. In one experiment each 
pair of items to be learned consisted of one 
high and one low association value syllable 
(heterogeneous pairs), while in the other 
experiment, both syllables were either high 
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or low association value. The same held true 
for the familiarization studies. 

The results showed that both association 
value and familiarization facilitated per- 
formance with the heterogeneous pairs, while 
only association value facilitated performance 
with the homogeneous pairs. These results 
were discussed in relation to an interpretation 
of the role of association value and familiari- 
zation in learning in terms of their effects on 
response availability 
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The theoretical importance of com- 
peting responses has been repeatedly 
emphasized in the past (Guthrie, 
1935; Wendt, 1936), and data are 
beginning to accumulate indicating 
that competing responses do affect 
ordinarily measured behaviors in im- 
portant ways. Cotton (1953) showed 
that a major part of the effect of 
drive on asymptotic running speed 
was due to the intervention of com- 
peting responses. When trials on 


which competing responses occurred 
were not analyzed, drive did not have 
as great an effect on performance as 
it did when competing response trials 


were included in the analysis. Pere- 
boom and Crawford (1958) showed 
essentially the same effect for mag- 
nitude of reward and number of 
acquisition trials. 

Adelman and Maatsch (1955, 1956) 
have shown that controlling the 
competing responses during extinc- 
tion results in a marked effect upon 
behavior. When they allowed Ss to 
climb out of the goalbox, running 
speeds were increased during extinc- 
tion. This finding was confirmed by 
Cotton, Lewis, and Metzger (1958) 
and extended to acquisition behavior 
for both the runway and the T maze. 

Sheffield (1950), Stanley (1952), 
Amsel (1958), and others have em- 

1This study was supported by a grant 
from the National Science Foundation. 

The writers wish to express their apprecia- 
tion for the work of Glen Jensen who par- 
ticipated in all phases of the present investiga- 
tion. 


phasized the drive-arousing properties 
of the extinction operation, and others 
(Spence, 1956; Taylor, 1951) have 
indicated the connection between 
drive and competing responses. 

There seems little doubt, then, 
about the importance of competing 
responses. And the effect of compet- 
ing responses in conjunction with 
drive is also of interest. The present 
paper is a report of an experiment 
in which two conditions of drive and 
two conditions of competing responses 
were manipulated factorially. The 
drive conditions were established by 
hours of food deprivation; the com- 
peting response conditions were estab- 
lished by two runway sizes, large (L) 
and small (S). It was assumed that 
the L condition would permit many 
competing responses, whereas the S 
condition would tend to prevent the 
competing responses from occurring. 
In the L runway S could turn around, 
scratch himself, move from one side 
of the alley to the other, and perform 
many responses other than running 
to the goalbox. The S runway, it was 
believed, would prevent most of these 
competing responses. 


METHOD 


Subjects.—The Ss were 80 naive male 
albino rats of the Sprague-Dawley strain. 
They varied from 66 to 81 days of age, with 
a mean of 74.3 days. Because it was necessary 
to make structural adjustments in the ap- 
paratus for different experimental groups, 
departures from purely random assignment 
of Ss were necessary. When Ss arrived, they 
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were assigned by a table of random numbers 
to their individual living cages. Then they 
were assigned to the experimental conditions 
in blocks of four with each block of four 
serving under identical conditions. Four 
Ss were run in each of the four experimental 
conditions before any one of the experimental 
conditions received its second assignment 
of four Ss. Replications continued in this 
fashion until 20 Ss were assigned to each 
of the experimental conditions. Four Ss 
were discarded because of a procedural error, 
and they were replaced. 

A pparatus.—The apparatus was a straight 
runway, the inside dimensions of which could 
be altered by putting in or taking out Plexi- 
glas inserts. Two adjustments of the inside 
dimensions were employed. In one case the 
runway was set at its maximum width and 
depth. In this L condition the inside dimen- 
sions of the runway proper were 83 in. high, 
4 in. wide, and 4 ft. long. 
sions of the goalbox were 84 in., 9 in., and 
104 in. The goalbox also contained a recess, 
2} in. by 1} in., in which the foodcup was 
placed so that it could not be seen before a 
complete goalbox entry. For the startbox 
the dimensions were 84 in., 4 in., and 6 in. 
The inside dimensions of the startbox for the 
S condition were 2} in., 12 in., and 6in. For 
the runway proper the dimensions were 2} in., 
1? in., and 4 ft. The goalbox was 2} in. high 
but remained unchanged in the other dimen- 
The interior of the 
painted flat black. 
fined as the interval required from the open- 
ing of the startbox door until S broke a photo- 
electric beam 2 in. from the goalboy. The 
distance between the startbox and the photo- 
electric beam was 444 in. Times were taken 
automatically by a Standard Electric timer, 
calibrated in .01 sec. 

Design.—Two variables were manipulated 
factorially. One was the number of hours of 
food deprivation and the other was the inside 
dimensions of the apparatus. One-half of the 
Ss were assigned to the L runway for acquisi- 
tion and extinction, and one-half were 
assigned to the S runway. One-half of the Ss 
run in the L apparatus were deprived ot food 
for 2-3 hr. at the beginning of any experi- 
The remaining half of the 
Ss were deprived approximately 22 to 23 
hr. The same deprivation conditions were 
applied to Ss run in the S apparatus. 

Deprivation and taming.—All Ss _ were 
placed on their appropriate deprivation 
schedule at least seven days before the initia- 
tion of experimentation. The Ss were handled 
for at least 5 mio. on each of these seven days 


The inside dimen- 


sions. apparatus was 


Response time was de- 


mental session. 
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and made familiar with the 45-mg. lab chow 
pellets to be used during the acquisition 
phase of the experiment. 

Acquisition.—Each S received 10 trials 
a day for five days. Then two days intervened 
during which S received no trials, but was 
maintained on the appropriate deprivation 
schedule. There followed another 10 trials 
a day for five days. A total of 100 acquisition 
trials was given with approximately a 15-sec. 
intertrial interval and’ with a 45-mg. pellet 
available in the goalbox on each trial. The 
Ss were detained in the goalbox approximately 
10 sec. on each trial. 

Extinction.-Extinction began two days 
after the one-hundredth acquisition trial. 
During these two days Ss were maintained 
on their appropriate deprivation schedule. 
Ten nonreinforced trials a day for five suc- 
cessive days were given during the extinction 
period. If on one of these trials S did not 
reach the goalbox within 1 min., he was re- 
moved from the apparatus and the response 
was recorded as 1 min. The intertrial interval 
remained at approximately 15 sec., and Ss 
continued to be detained in the goalbox for 
approximately 10 sec. on each trial 


RESULTS 


Acquisition.—The reciprocal of the 
running time (speed) for each trial 
was determined for each S, and all 
analyses were performed on 
speeds. 


these 
A graph of the mean speed 
by 10-trial blocks is presented in 
Fig. 1. An over-all analysis of vari- 
ance was performed on the means 
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Fic. 1. The relationship between acquisi- 
tion trials and running speed. Each of the 
10 points on the abscissa represents the mean 
of 10 trials. Speeds are the reciprocals of 
running time in seconds. 
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of the 100 acquisition trials. The 
apparatus size was found to be 
not an effective variable (F = .32, 
df = 1/76). Drive was significant 
(F = 15.18, df = 1/76); and there 
was no interaction between apparatus 
size and drive (F = .03, df = 1/76). 
Figure 1 shows the clear superiority 
of the high-drive groups throughout 
acquisition. It also seems clear from 
Fig. 1 that the two drive conditions 
resulted in different asymptotes. The 
slight dip in all four acquisition curves 
just after Trial 50 can probably be 
explained by the two-day interval 
after Trial 50 during which Ss 
received no training. 

The over-all analysis of variance, 
which shows apparatus size not to be 
an effective variable, is probably not 
completely appropriate for these data 
because the analysis includes values 
for all trials in the computations of 
means for each group. The initial 
elevation of the S groups is counter- 
weighted by the later elevation of the 
L groups, and the over-all analysis 
is insensitive to this change as a 
function of trials. Therefore, another 
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analysis of variance was performed 
with trials as an added variable. 
The means of 10-trial blocks 31-40, 
41-50, 51-60, and 61-70 were taken 
for each S and these scores made up 
the added variable of trials. This 
analysis confirms what Fig. 1 seems 
to show: that the effect of appara- 
tus size is a function of the number 
of acquisition trials. The F for the 
interaction between apparatus size 
and trials was 5.53. A value of 3.97 
is required for significance at the .01 
level with 3 and 114 df (the pooled 
Ss X Trials served as an error term). 

Perhaps, because of its post hoc 
nature, this “repeated measurements” 
analysis capitalized on observed chance 
variation in the data. The alterna- 
tive is to be blinded by a poor first 
guess as to the appropriate analyses. 
The final test of any empirical finding 
is its reliability over several experi- 
ments. 

A record was kept of all the trials 
on which competing responses oc- 
curred. A competing response was 
defined as any behavior which did not 
result in forward progress through 
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rABLE 1 


ANALYSIS OF THE CHANGE IN COMPETING 
RESPONSES WITH TRIALS DURING Ac- 
QUISITION AND EXTINCTION 


Acquisition Extinction 


Drive (D) 

Apparatus size (A) 

DXA 

Between Ss in the 
same group 

Trials (T) 

rxvD 

rT XA 

TXDXA 

Pooled Ss X T 


P= MM. 


the maze. Such behaviors as retrac- 
ing, face washing, and scratching 
were ruled as competing responses. 
Figure 3 the percentage of 
trials on which one or more competing 
responses occurred as a function of 
the manipulated variables, and Table 
1 shows the result of “repeated 
measures”’ analysis of these data. 
acquisition none of 


shows 


the 


During 
variables resulted in significant dif- 


ferences by themselves, but they 
both interacted significantly with 
trials. In addition, there was a 
significant triple interaction of the 
two variables with trials. If the 
mean square for the triple interaction 
is used as the error terms in testing 
the other differences, only the F 
for Trials remains significant. The 
Trials F means that the number of 
competing 
acquisition proceeds. 

The triple interaction seems to be 
attributable to the differential effect 
over trials of low drive, depending 
on the apparatus that is used. In 
the L apparatus the low-drive group 
yields a higher percentage of trials 
with competing responses early in 
acquisition than does the high-drive 
group. Late in acquisition, however, 
the low-drive group in the large 


responses decreases as 
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apparatus has very few competing 
responses, whereas the low-drive group 
in the small apparatus has many. 

To state the meaning of this triple 
interaction in another way : Apparatus 
has no differential effect over trials 
for the high-drive group. And, in 
general, fewer competing responses 
occur in the S apparatus throughout 
acquisition. The effect of drive, 
however, is reversed from early to 
late acquisition for the L apparatus 
groups. Early in acquisition more 
competing responses occur in the L 
apparatus, and late in acquisition 
more competing responses occur in 
the S apparatus. Decreasing appara- 
tus size decreased the percentage of 
trials on which competing responses 
occurred only very early in acquisi- 
tion for the low-drive group. Other 
than that, more competing responses 
occurred in the S apparatus. 

Extinction.—An_ over-all analysis 
of variance of the extinction running 
speeds was performed. Both Drive 
(F = 19.1 df = 1/76) and the Ap- 
paratus (F = 13.5, df = 1/76) were 
highly significant variables. There 
was no interaction between the two 
(F = .05, df = 1/76). The _ inter- 
pretation of this analysis can be 
obtained from Fig. 3. This figure 
indicates that the S apparatus re- 
sulted in increased resistance to ex- 
tinction, and that the greater drive 
also resulted in prolonged extinction. 
An analysis of covariance in which 
the mean of the last 10 acquisition 
trials was used as the control variable 
and the mean of all extinction trials 
as the prediction variable gave ap- 
proximately the same results. The 
F for the apparatus was increased 
to 28.86 for 1 and 75 df, however. 
An analysis of covariance, again with 
the mean of the last 10 acquisition 
trials as the control variable, but 
with the mean of the last 10 extinc- 
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EXTINCTION TRIALS 


Fic. 3. 


The relationship between extinction trials and running speed. 


Points 1 to 2, 3 to 4, 


5 to 6, 7 to 8, and 9 to 10 represent blocks of 10 extinction trials given on each of five days. 
Points 2 to 3, 4 to 5, 6 to 7, and 8 to 9, represent spontaneous recovery between 10-trial 


extinction blocks. 


tion trials as the predicted variable, 

shows the Apparatus as remaining 

significant (fF = 10.10, df = 1/75). 
Finally, a repeated measurements 


analysis was performed using the 
mean of the last 10 acquisition trials, 
the mean of extinction Trials 21 to 
25, which form the midpoint of the 
extinction trials, and the mean of 
the last 5 extinction trials as points 
for the trend. A summary of this 
analysis is presented in Table 2. 
Drive alone remains significant; Ap- 
paratus is not significant. Because 
all the other extinction analyses show 
Apparatus to be a highly significant 
variable, this low F value is perhaps 
surprising. The reasons for the 
nonsignificant F for Apparatus can 
be seen by looking at Fig. 1 and 3 
jointly. At the end of acquisition, 
shown in Fig. 1, the mean of the S 
conditions is below that of the, L 
conditions. During extinction, shown 


in Fig. 3, just the reverse is true; the 
mean of the S conditions 
that of the L conditions. This 
reversal cancels out the over-all 
effect of the apparatus. The reversal 
also means that there should be a 
significant interaction of apparatus 
with trials. The very significant F 


is above 


TABLE 2 


ANALYSIS OF THE CHANGE IN EXTINCTION 
RUNNING SPEEDS WITH TRIALS 


Source 
Drive (D) 


Apparatus size (A) 

DXA 

Between Ss in the 
same group 

Trials (T) 

TXD 

% MA 

t2e KS 

Pooled Ss X T 


NN bd he bh 


_ 
ws 


*P = 01. 
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for the Trials X Apparatus interac- 
tion confirms this interpretation. The 
S apparatus resulted in slower running 
speeds at the end of acquisition but 
in faster running speeds throughout 
most of extinction. 

The interaction between drive and 
trials is also significant. This sig- 
nificance is attributable to the general 
decrease in the effectiveness of drive 
as extinction progresses. 

Figure 3 and an analysis of variance 
that during 
competing responses occurred in the 
lL. apparatus, and that drive had no 
effect on competing responses. 

Spontaneous recovery.—Ten extinc- 
tion trials were run on the first ex- 
tinction day, and 10 trials a day were 
given for four succeeding days. To 
determine the amount of spontaneous 
recovery, the difference between the 
speeds on the tenth extinction trial 
and the eleventh extinction trial was 
computed, and an analysis of vari- 
ance performed on these difference 
scores. Drive was very significantly 
effective (F = 10.4, df = 1/76), Ap- 
paratus size was not (F = 1.2), and 
there was no significant interaction 
(F = .14). 
3 shows clearly that the higher the 
the the 


recovery . 


show extinction more 


between the two Figure 


drive, more spontaneous 


DISCUSSION 


\s in a study by Cotton, Lewis, and 


Metzger (1958), a wide runway was 
accompanied by an increased number 
of trials on which competing responses 


Because more competing re- 
usually 


occurred. 


sponses meant slower running 


times, rapid extinction corresponded 
with an increase in competing responses. 
The two measures are, however, to some 
extent independent. The S, for example, 
can run fast or slow on trials on which 


Also, 


it is possible for a competing response 


no competing responses occur. 


to occur on a fast trial as well as on a 
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slow trial. Therefore, it is meaningful 
to conclude that competing responses 
are at least one determiner of extinction, 
as measured by decreased running times, 
and this conclusion is in line with a num 
ber of interference theories of extinction. 

That the S 
increased the 


apparatus, in general, 
number of 
responses was a surprise, although there 
was at least a suggestion of such a result 
in the previous Cotton, 
Metzger study. Evidently 
size has a different effect on competing 
responses during acquisition than during 
And it is clear that 
little bigger 
body, does not necessarily curtail com 
The initial plan to 
manipulate competing responses by re 
stricting the area for locomotion was a 
failure. 


competing 


Lewis, and 


apparatus 


extinction. a small 


apparatus, than a rat’s 


peting responses. 


SUMMARY 


Two conditions of drive—2 and 22 hi 
of food deprivation 1 fac- 
torially with a large and a small runway in an 
attempt to determine some of the effects 
of drive on competing responses. The length 
of the runway was the same in both apparatus 
conditions, but the small runway was con- 
stricted greatly in other dimensions in an 
attempt to reduce competing 
Twenty animals were assigned to each of the 
four conditions. They were given 100 acquisi- 
tion and 50 extinction trials. 


were combines 


responses 


The Ss were 
run under the same experimental conditions 
during extinction that they had experienced 
during acquisition. 

The major findings were: 


1. High-drive resulted in faster running 
speeds throughout 
extinction. 

2. Early in acquisition the small apparatus 
resulted in faster speeds; there is a reversal 
of this trend later in acquisition. 

3. The apparatus 
number of trials on which competing responses 
occurred only for the first few trials of acquisi- 
tion and then only for the low-drive groups 
Other than this, more competing responses 
occurred in the small 
acquisition. 


acquisition and early 


small decreased the 


apparatus during 

4. During most of extinction the small 
apparatus produced greater running speeds 
than the large, although this 
was lost by the end of extinction, 


superiority 
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5. The high-drive condition resulted in 
greater spontaneous recovery. 

6. In extinction, the small apparatus 
served to reduce competing responses. 
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LEARNING AND RETENTION OF PAIRED ADJECTIVES 
AS A FUNCTION OF PERCENTAGE OCCURRENCE 
OF RESPONSE MEMBERS 


RUDOLPH W. SCHULZ! anp WILLARD N. RUNQUIST? 


Northwestern University 


If the paired-associate verbal learn- 
ing situation is considered an analogue 
of the classical conditioning situation, 
then the presentation of the response 
member of the S-R pairs corresponds 
to the presentation of the UCS and 
reinforcement. Therefore 
in percentage occurrence of response 
members in the paired-associate situ- 
ation should be analogous to variation 
in the reinforcement 
in the conditioning situation. 


variation 


percentage of 


At the time the present study was 
begun, percentage occurrence of re- 
members’ (hereafter abbrevi- 
ated ORM) had not been investigated 
in paired-associate verbal learning. 
Recently, Goss, Morgan, and Golin 
(1959) have reported that the mean 
number of trials to learn paired non- 
sense-syllable lists to criterion de- 
creased as percentage ORM increased. 
Adjective lists were used in the present 
experiment to investigate the ORM 
dimension at five points; namely, 
20%, 40%, 60%, 80%, and 100% 
ORM. 

It is well established 
mittently reinforced are 
typically more resistant to extinction 
thancontinuously reinforced responses. 
If an unlearning (Melton & Irwin, 
1940; Melton & von Lackum, 1941) 
extinction-like 1948) 
process underlies forgetting, at least 
in part, then it is conceivable that a 
paired-associate list learned with in- 


sponse 


that inter- 
responses 


(Underwood, 


! Now at Carnegie Institute of Technology, 
Pittsburgh, Pennsylvania. 

2Now at 
California, 


Pomona College, Claremont, 


termittent ORM will be more resistant 
to forgetting than a list learned with 
continuous ORM. Therefore a second 
purpose of the present experiment 
was to study retention as a function 
of percentage ORM. 


METHOD 


List construction.—All Ss in all conditions 
learned the 10 pairs of adjectives. 
Intralist similarity among these pairs was 
low. Details concerning the original selection 
of the pairs may be found elsewhere (Under- 
wood, 1951). Five lists, differing only with 
respect to percentage ORM, were constructed 
utilizing these 10 pairs of adjectives. The 
method of manipulating percentage ORM 
was essentially identical with that used by 
Goss, Morgan, and Golin (1959); hence there 
is no need to detail it here. In short, the 
occurrence of the response members of the 
adjective pairs was scheduled so that per- 
centage occurrence per trial for a given list 
was the same as the percentage occurrence 
for a given pair in the list within successive 
blocks of five trials. For example, with 20% 
ORM, 2 of the 10 stimuli were accompanied 
by their response members on each trial. 
The remaining eight stimuli were accom- 
panied by a blank. On one of the five trials 
within each successive block of five trials each 
stimulus was accompanied by its response 
member once or 20% of the time. While 
Goss et al., used an 8-item list of nonsense 
syllables with 25%, 50%, 75%, and 100% 
ORM, the present study employed a 10-item 
list of adjectives with 20%, 40%, 60%, 80%, 
and 100% ORM. 

Subjects and procedure.—The 20%, 40%, 
60%, 80%, and 100% ORM lists were learned 
by five different groups of 50 Ss each. The 
250 Ss were students at Northwestern Univer- 
sity taking introductory psychology courses. 
They were naive to the learning of paired 
adjectives, although some had served in other 
verbal learning experiments. The Ss were 
assigned to lists in rotation as they appeared 
at the laboratory. 


same 
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The Ss learned by the anticipation method. 
All Ss were given standard paired-associate 
learning instructions. In addition, Ss learn- 
ing the intermittent ORM lists were warned 
that the response members of the pairs would 
be omitted a certain proportion of the time 
and that they should do their best in spite of 
this handicap. 

The lists were presented on a Patterson 
memory drum at a rate. The 
stimulus term was presented for 2 sec. while 
a shutter covered the response term. Then 
the shutter lifted and both the stimulus and 
the response (or a stimulus and a blank on 
non-ORM trials) remained in the aperture 
for an additional 2 sec. The intertrial inter- 
val was 4 sec. The criterion for learning and 
relearning was one perfect recitation of the 
list. All relearning was with the 100% ORM 
list. The Ss reported to the laboratory at 
the same time on two successive days. Ac- 
quisition took place on the first day and re- 
learning on the second day. Hence the aver- 
age interval between the end of acquisition 
and the beginning of relearning was approxi- 
mately 23.5 hr. 

A total of 24 Ss had to be eliminated from 
the experiment for various reasons. Elimina- 
tion rate was not differential with respect 
to conditions, except in the case of failure to 
understand the instructions for learning. The 
seven Ss dropped for this reason were all in 
either the 20%, 40%, or 60% ORM groups. 
Therefore, if ability to understand instructions 
is highly correlated with ability to learn paired 
adjectives, then the sample of Ss in these 
groups may be slightly superior to the Ss 
in the 80% and 100% ORM groups. 

Statistical analysis —Analysis of variance 
was used when the scores under consideration 
met the requisite assumptions. The Fs have 
4 and 245 df in each comparison, with an F of 
2.37 required for significance at the 5% level. 
If either heterogeneity of variances or indica- 
tions of nonnormality were present, then the 
median test (Dixon & Massey, 1957, p. 296) 
was used. The x? in this test has 4 df in 
each comparison, and a x? of 9.49 is required 
for significance at the 5% level. 


2:2-sec. 


RESULTS 


Acquisition.—The present results, 
for measures identical to those re- 
ported by Goss et al. (1959), are 
shown in Table 1.4 Their results for 


3 A third measure, mean reinforced correct 
responses, was reported graphically by Goss 
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TABLE 1 


COMPARISON OF Goss, MORGAN, AND GOLIN 
RESULTS WITH SCHULZ AND 
RUNQUIST RESULTS 


| 
Means 


; Percentage — 
Study | ORM ' 
Trials | ,Correct 
. Response 
Goss, Morgan, 106 
& Golin 7 34.3 
(N = 20) 5¢ 40.6 
) 


h 40.5 


24.8 


Schulz & 100 
Runquist 80 
N = 50) 60 

40 
20 


* om based on within groups variance estimate 


for trials =1.22, and for correct responses =5.62. 
Corresponding Fs were 4.07 (P <.01) and 2.68 
(P < .05). 


8-item syllable lists learned to a 7 
criterion at a 2: 2-sec. rate of presenta- 
tion are also presented in Table 1 to 
facilitate the comparison of the two 
studies. For the present study, it may 


be seen from Table 1 that the mean 
number of trials and correct responses 
in learning to criterion generally 
decreased as percentage ORM _ in- 
creased. The ‘“deviant’’ performance 
of the 60% ORM group is the only 
exception to an otherwise monotonic 
relationship for both measures. The 
differences among these means are sta- 
tistically reliable (for trials F = 4.07, 
df = 4/245, P < .01; for correct re- 
sponses F = 2.68, P < .05). It is 
also apparent from Table 1 that the 
present results agree well with those 
obtained by Goss et al., though the 
differences between the means in the 


et al., (1959). Since these means are readily 
estimated from the data in Table 1 by mul- 
tiplying the mean number of correct responses 
for a given group by the proportion of ORM 
it received, they are not reported here. In 
accord with Goss’ finding, the present mean 
number of reinforced correct 
creased as percentage ORM increased. 


responses in- 
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present study are generally of smaller 
magnitude. 

[t was our impression, while run- 
ning Ss, that Ss in the 20% ORM 
group seemed less likely than 100% 
ORM Ss to fail to anticipate an item 
after once having anticipated it cor- 
rectly. A score designed to show 
whether our impression was, in fact, 
a valid one was computed for all Ss. 
This score, hereafter referred to as 
ratio after 
first correct, was computed as follows. 
The S’s data sheet was inspected to 


the Correct/Opportunity 


determine when the response member 
of a given pair of 
anticipated correctly. 
ber of subsequent 


was first 

Then the num- 
opportunities S 
had to respond to the stimulus mem- 
ber of that pair were noted as well 
as the number of times he responded 
This was done for all 10 
pairs on the list. Next, the total 
number correct was divided by the 
The 
Correct/Opportunity ratio after first 
correct is the result of this division. 
Thus, the less frequently S misses 
items after anticipating them correctly 
once, the higher his score. The results 
of this analysis are shown in Fig. 1. 
Inspection of Fig. 1 that 


items 


correctly. 


total number of opportunities. 


reveals 
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Fic 1. Mean Correct/Opportunity ratios 
after first correct as a function of percentage 


of occurrence of response members. 
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performance in terms of mean Cor- 
rect/Opportunity ratio after first cor- 
rect was a monotonic inverse function 
of percentage ORM. With 20% 
ORM there were, on the average, 6% 
more correct responses per oppor- 
tunity than with 100% ORM. Sta- 
tistically, the differences among the 
means of Fig. 1 may be considered 
reliable (x? = 13.8; P < .01). The 
standard error of these means ranged 
from .010 to .014. 

The mean number of overt errors 
per trial during acquisition was di- 
rectly related to percentage ORM. 
The respective means for 100%, 80%, 
60%, 40%, and 20% ORM were 1.09, 
1.03, .87, .69, and .58. The differences 
among these means meet the re- 
quirements for statistical significance 
(F = 6.22; P < 01). The standard 
error of these means is .085. 

Retention.—The mean number of 
correct responses on the recall trial 
did not vary as‘a function of per- 
centage ORM. The means were 6.38, 
6.84, 6.58, 6.50, and 6.62 for 100%, 
80%, 60%, 40%, and 20% ORM, 
respectively. The o,, ranged from .24 
to .26. Clearly, the null-hypothesis 
could not be held suspect (x? = 1.92; 
P > .80). Adjustment for possible 
differences in response strength at the 
end of acquisition were made via prob- 
ability analysis (Underwood, 1954), 
but the conclusion that percentage 
ORM did not affect retention re- 
mained unaltered. The relearning 
data mirrored the recall data and 
therefore need not be presented. 


Discussion 


Acquisition.—The fact that the present 
differences in performance as a function 
of percentage ORM were of much smaller 
magnitude than those obtained by Goss 
et al. (1959) can most probably be at- 
tributed to the difference in the level 
of meaningfulness of the learning ma- 
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terial employed in the two studies. This 
seems especially likely because meaning- 
fulness is known to be a potent source of 
variance in verbal learning (Underwood 
& Schulz, 1960). Therefore, it appears 
reasonable to infer from these studies 
that the effectiveness of variation in 
percentage ORM may be _ inversely 
related to the meaningfulness of the 
material to be learned. 

The general failure to find more 
marked decrements in performance with 
reduced ORM surprised us, as it did 
Goss et al. The Ss in the present 20% 
ORM group saw the response members 
of the list an average of 4.5 times in 
learning to criterion as contrasted with 
17.5 exposures of each response member 
with 100% ORM. Goss has pointed 
out some factors which may have 
counteracted the effect of reduction in 
ORM. The present results seem perti- 
nent to two of these factors; namely, 
amount of opportunity for rehearsal 
and amount of response competition per 
trial. 

Consideration of the inverse relation- 
ship between percentage ORM and 
Correct/Opportunity ratio after first 
correct (cf. Fig. 1) in conjunction with 
the data on trials to learn to criterion 
suggests that the detrimental effects of 
reduced ORM reflected by the latter 
measure must have been confined to the 
relatively earlier stages of learning which 
primarily involved the accomplishment 
of the first correct anticipation of a 
given response member, since mainte- 
nance of the response thereafter proceeds 
better with intermittent ORM. The 
differential opportunity for rehearsal 
is also greatest during the early stages 
of learning and decreases steadily as S 
attempts to anticipate more and more 
response members in the low ORM 
condition. Therefore, rehearsal is un- 
doubtedly an important factor in the 
attenuation of the effectiveness of reduced 
ORM. 

Finally, if the occurrence of overt 
errors can be taken as an index of 
response competition, then the direct 
relationship between frequency of overt 
errors per trial and percentage ORM 
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found in the present study suggests that 
response competition does in fact increase 
as percentage ORM increases. Hence, 
the second important source of com- 
pensation for the reduction of ORM 
must be associated with reduced response 
competition throughout learning under 
conditions of intermittent ORM. Un- 
fortunately, the present data fail to 
provide any clues as to a possible mecha- 
nism by means of which intermittent 
ORM could reduce competition. 

Retention.—The retention results un- 
equivocally failed to support the hypoth- 
that intermittent ORM increases 
resistance to forgetting under the condi- 
tions of the present experiment. Per- 
haps the present experiment was simply 
a poor one for a test of the hypothesis. 
A better test might be provided by an 
experiment in which the _ traditional 
A-B, A-C retroaction paradigm was 
utilized to study amount of retroactive 
inhibition as a function of percentage 
ORM in original learning. 


esis 


SUMMARY 


The effect of variation in percentage of 
occurrence (20%, 40%, 60%, 80%, 100%) 
of response members (ORM) on the learning 
and retention of paired-adjective lists was 
studied. A total of 250 Ss, 50 per list, learned 
the same list under the five ORM conditions 
to a criterion of one perfect recitation. Re- 
tention was measured 23.5 hr. after acquisi- 
tion. The following results were obtained: 

1. The mean number of trials and correct 
responses required to reach the criterion 
decreased as percentage ORM increased. 

2. Performance subsequent to the first 
correct anticipation of a given pair was an 
inverse function of percentage ORM. That 
is the ability to maintain a correct associa- 
tion, once made, was greater with intermittent 
than with continuous ORM. 

3. The mean number of overt errors per 
trial increased as percentage ORM increased. 

4. Mean number of correct responses on 
the recall trial was unaffected by variation 
in percentage ORM during acquisition. 
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INTERPOLATED ACTIVITY ! 
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Bilodeau and Bilodeau (1958) re- 
cently published results from five 
studies on delay of knowledge of 
results (ICR) in motor learning tasks. 
In ‘none of five studies was 
delay a significant variable. Delays 
hour, day, and week 
produced only null results in lever- 
pulling and knob-turning tasks. Null 
results were not surprising, for the 
responses learned were relatively sim- 
ple and discrete, no special inter- 
fering activities were interpolated, 
and potentially confounding temporal 
variables were carefully controlled, 
i.e., intertrial interval and post-KR 
interval. 


these 


by seconds, 


Moreover, it was shown that these 
results were well supported by the 
literature. There was, however, one 
outstanding exception that proved 
difficult to’explain without introduc- 
ing petty criticisms. This was the 
study by Greenspoon and Foreman 
(1956) that not only produced signifi- 
cant results but also yielded a hand- 
some set of delay curves with 0-, 10-, 
20-, and 30-sec. parameters. Green- 
spoon and Foreman’s conclusion 
‘The results of this experiment demon- 
strate the efficacy of immediate knowl- 
edge of results in the learning of a 
motor task,’”’—was quite at variance 
with that of Bilodeau and Bilodeau. 
It was judged that by far the best 
way to deal with the conclusion was 
to hypothesize that the findings could 
not be replicated. 


' This work was supported by a grant from 
the National Science Foundation. 


The essence of Greenspoon and Foreman’s 
method was as follows. The Ss were taught 
to draw 3-in. straight lines while blindfolded. 
KR was “long,”’ “‘short,”’ or ‘‘right.””. There 
were eight Ss in a group; each S drew 50 
lines at intervals. To Greenspoon 
and Foreman, the critical part of the pro 
cedure was that after drawing a line S re- 
tained his grip of the pencil, kept hand and 
arm free of the table, and allowed E to return 
the hand to the starting point at the end of the 
30-sec. intervals. The foregoing, called 
“hand-maintaining activity,’ required S to 
keep his arm off the table for the whole 25- 
min. testing and must 
quite tiring. 

In an earlier study by Saltzman, Kanfer, 
and Greenspoon (1955) in which delay of KR 
had no effect on line drawing, the procedure 
was different in two respects. First, S rested 
his elbow on the table and briefly raised his 
pencil when the line was finished. (Again, 
E returned S’s hand to the starting point.) 
Second, the point at which the response 
originated was only approximately the same 
from trial to trial. Greenspoon and Foreman 
allow the possibility that either procedural 
difference may account for the discrepancy 
of outcome. As they put it, (a) delayed 
information may differentially reinforce hand- 
maintaining activity so that longer delays 
of information are less effective than shorter 
delays, and (b) Saltzman’s et al. slightly 
variable starting point made enough of a 
stimulus change from trial to trial to reduce 
the effects of delay. 


30-sec. 


session have been 


If we were convinced of the relia- 
bility of the above differences in out- 
come, we might offer arguments about 
the interpretation, but 
disagreement is over fact, not inter- 
pretation. It is difficult to persuade 


our basic 


us, with a single experiment, that 
such procedural differences as de- 
KR 
The major objective of the 


scribed could make delay of 


effective. 
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present study, thus, was to replicate 


two of their groups. Also tested were 
two groups of Ss who kept the hand 
at rest between responses, with the 


starting point controlled. 


METHOD 


Subjects.—The Ss were 48 undergraduate 
Men 
and women were assigned in equal number 
to each of four groups. 

A pparatus. 


men and women of Tulane University. 


The apparatus was patterned 
after that of Greenspoon and Foreman: a 
continuous strip of paper rolled over a piece 
of plate glass fastened to a Jaboratory table. 
Che paper slid beneath a carpenter’s square. 
Thus, the glass formed a smooth 
surface beneath the paper and the square 
served as a straight edge for S’s pen. The 


writing 


inner corner of the square was the starting 
point for the pen and, of course, the rulings 
on the square permitted E to measure the 
The apparatus 
was covered with a black cloth except when 
S was blindfolded. 
A Lindy pen was used instead of a me- 
chanical pencil. 


length of line as it was drawn. 


This substitution improved 
the riding qualities of the instrument along 
the square. Furthermore, the properties 
of the writing point were less variable and 
the point could not be broken. Adhesive 
tape was wrapped around the pen from 2 to 
4 in. from the point. The tape made a non- 
slip grip during the hand-maintaining pro- 
cedures lhe useful 
purpose of standardizing the grip between 
Ss and for trials within Ss. 
Procedure A 2X2 _ factorial 
tion of delay of KR and maintenance was 
used. 


tape also served the 


combina- 


KR was administered almost immedi- 
ately after a response (1) or was delayed by 
20 sec. (D). The hand and arm were main- 
tained (Hold) or they were not (Rest). 
The Ss, blindfolded, had 50 trials. 


When 


the first 35 trials were under Treatment I, 


the last 15 were D trials. When 
35 trials were under Treatment 


15 were I trials. 


the first 
D, the last 


The basic sequence of events for all groups 
was a ready signal, a ‘“‘draw”’ signal 5 sec. 
later, and then KR followed in turn by an- 
other ready signal. 
curred every 30 sec. All four groups were 
kept to a 30-sec. intertrial interval; the post- 
KR periods were approximately 30 and 10 sec. 
long for the I and D treatments, respectively. 

KR was “long,” “right,’”’ and “short.” 
The E said “right’’ when the line was within 


The draw signals oc- 
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25 in. of the 3.00-in. goal. However, E 
recorded S’s response error (line length less 
3.00 in.) irrespective of KR. 

The following practices define hand main- 
taining and the Rest and Hold treatments. 

Treatment rest.—U pon completing his line, 
S returned his hand and arm to his lap. The 
S held the pen, but was otherwise relatively 
relaxed. “Hand”’ spoken by E was the signal 
to present hand and pen to E who then guided 
them to the starting point. ‘“Hand”’ 
ceded “Draw” by 5 sec. 

Treatment hold.—U pon completing his line, 
S initiated no further responses. He main- 
tained his grip of the pen, kept the pen point 
where he had stopped it, and kept his arm 
off the table (forearm parallel to the table 
top). ‘Hand’ spoken by E was the signal 
that E was just about to grasp and to guide 
S’s hand to the starting point. 


pre- 


Again, the 
“Hand” signal preceded the “Draw” 
by 5 sec. 


signal 
Treatment Hold was unpleasant, 
tor S had to hold his arm out for about half 
an hour. Nevertheless, all Ss followed 
directions. 

Design compared with Greenspoon and Fore- 
man.—In all essential respects, Groups I-Hold 
and D-Hold are replicates of the groups with 
0- and 20-sec. delays in the study by Green- 
spoon and Foreman. One immaterial differ- 
ence is that our last 15 trials are transfer 
shift trials. This modification was possible 
because the differences between their groups 
were produced during the 
not during the late trials. 

Had it desirable, we 
should, of course, have repeated the condi- 
tion with the longest delay, i.e., 30 sec. It 
would seem nearly impossible to have a 30- 


early trials and 


been possible or 


sec. interresponse interval and delay KR 
by 30 sec. If the administration were pos- 
sible, it would be generally undesirable for 
S would have 0 sec. to respond to his feedback. 
Greenspoon and Foreman reported the 
results in terms of right responses. Right 
responses and the more analytic measure, 
magnitude of error, will be reported here. 


RESULTS 


The data were first examined for 
the mean number of right responses 
in successive blocks of five trials, an 
analysis identical with that of Green- 


spoon and Foreman.’ There were no 


2 Tables of data have been deposited with 
the American Documentation Institute. Or- 


der Document No. 6301, remitting $1.25 
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consistent differences between groups, 
there being several changes in rank 
over blocks of trials. An important 
question is whether the present results 
are of the same order of magnitude 
as those of Greenspoon and Foreman. 
They obtained mean rights of 27.38 
and 15.00 for Groups I and D, 
respectively. Projecting our 35-trial 
results to a base of 50 trials, we obtain 
27.74 and 26.67 for Groups I and D, 
respectively. There is, thus, remark- 
able agreement for I and marked 
disagreement for D. Obviously, our 
D Ss outperformed theirs. Our D 
Ss were about twice as variable, but 
our I Ss were comparable. Moreover, 
within their own experiment, D Ss 
showed the lesser variability. It is 
our guess, then, that the fundamental 
contradiction between studies is at- 
tributable to their D group which 
is low in mean and variability. 

A more analytic measure of the 
line-drawing response is the mean 
absolute error. Means based upon 
successive blocks of five trials are 
plotted in Fig. 1. The left side of the 
figure represents the trials before the 
shift in delay, the right side the 


for 35-mm. micrefilm or $1.25 for 6 by 8 in. 
photocopies. 
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12345678910 
BLOCKS OF FIVE TRIALS 


Fic. 1. The effect of delay of knowledge 
of results and of maintenance. For Blocks 
8, 9, and 10, the I groups were switched to 
Delay and the D groups were switched to 
Immediate KR. 


trials after the shift. The only 
discernable trend is that of learning; 
no differences appear ascribable to 
delay of KR or to maintenance. 

The means and variances of the 
totals for right and errors per S over 
the first 35 trials are given in Table 
1. The column means, based upon 
24 Ss, show that the greater number 


TABLE 1 


MEANS AND VARIANCES FOR THE First 35 TRIALS 
(Right represents the number correct; error (1/16 in.) represents the 
absolute error per S) 


Delay 
Maintenance 


Error 
Hold 
(12.63) 


Rest 1 
(13.72) (648.70) 


8. 
3. 
8. 


Mean 18.92 180.00 


Right 


18.67 
(18. 


42 } 19.50 
72 (11.54) 


19.08 


Error Right Error 


184.33 19.04 184.75 


79) (2596.42) 


162.75 
(1053.11) 


18.96 168.79 


173.54 19.00 176.77 
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TABLE 2 


MEANS AND VARIANCES FOR THE Last 15 TRIALS 
(Right represents the number correct; error (1/16 in.) represents the absolute 


error per S. 


Group I has been switched to Delay and Group D has 


been switched to Immediate KR) 


Former Delay 


Maintenance 


Right 
10.25 
(1.66) 


Hold 


Rest 


Mean 


of rights and the lesser error were 
obtained by Treatment D. Neither 
the Delay nor Maintenance variable, 
however, exerts much influence for all 
of the entries are exceedingly similar. 
This was amply confirmed by analysis 
of variance of the two factorials in 
Table 1. Most of the Fs were less 
than 1.00 and none approached the 
.05 level. 

The right responses and the errors 
for Trials 36-50 are shown in Table 
2. These are the transfer-shift trials; 
Treatment I actually received delayed 
information and Treatment D received 
immediate information. 
all of the entries are very similar. 
In terms of numerical differences, 
both measures of response favor the 
treatment with former immediate KR 
and present delay of KR. 
tests, however, failed by a wide margin 
to reach the .05 level. 

There are eight opportunities to 
compare delay with immediate infor- 
mation in the cells of Tables 1 and 2. 
In five of the eight, the better per- 
formance is associated with the delay 
treatment. These comparisons are 
mentioned in order to make a minor 
point—the difficulty in maintaining 


Once again 


Right 


8.58 64.50 


The F 


Mean 


Error Right Error 


59.62 


9.42 
(546.82) 


9.46 


9.44 


that the better treatment is I. The 
major point is that there is no source 
of significant variance in Tables 1 and 
? 


DISCUSSION 


There was no evidence of special 
efficacy of immediate KR in the present 
study. Greenspoon and Foreman give 
evidence of such efficacy when the hand 
was maintained. They suggested that 
the positive result might be obtained 
when the hand was maintained, but not 
when the hand maintained. 
We have found no evidence that main- 
tenance, nonmaintenance interacts with 
delay of KR, and no evidence that 
delay of KR impairs the learning process. 
Further, the maintenance variable itself 
did not produce significant differences. 

The effects of delay of KR upon the 
learning have been recently 
discussed by Bilodeau and Bilodeau 
(1958). Maintenance is another matter. 
It may or may not interact with delay 
of KR, depending upon what we mean 
by the word. 

Meanings of matintenance.—lf S sus- 
pended activities after drawing a line, 
the traces of the response would be 
undisturbed. A complete suspension 
has never been achieved. It is typical 
procedure for S to initiate and execute 
a number of responses other than line 


was not 


process 
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drawing. Some of these involve the 
hand and arm and their movement to 
the table top, to the starting point, and 
so on. These extra responses are in- 
cluded within nonmaintenance proce- 
dures. As used here, maintenance also 
refers to certain observable responses 
of the hand and arm, e.g., the pencil- 
and arm-holding actions. Maintenance, 
as Greenspoon and Foreman recognized, 
was not the opposite of nonmaintenance. 
Neither the Rest nor Hold techniques 
used here produced or even approximated 
a state of suspended activity. 

Since the Thorndikian line-drawing 
task is not especially suited to achieving 
anything but a moderate degree of 
control over S’s extra responses, we 
prefer the Rest treatment as standard, 
neutral procedure. At least at present, 
S must do something with his arm and 
hand before and after KR, and the hand 
must be positioned for the beginning 
of the drawing response. Bilodeau 


and Bilodeau have suggested that drop- 
ping the arm to one’s side after responding 
and then engaging in normal waking 
activities offer just about as 


stand- 
ardizing (and neutral) a situation as we 
are likely to find for some time. The 
value of normal activity as a standard 
procedure is underscored when pre- and 
post-KR delays are of the order of hours, 
days, and even years. Maintenance at 
any level (limb or trace) would seem 
out of the question with such intervals. 

We allowed that both Rest and Hold 
activities represent potential interfering 
tasks, though not differential ones. 
Our null results do not permit con- 
cluding that level of interference varies 
from Rest to Hold, from I to D, and in 
the interaction of these two variables. 
But the extent to which the outcome 
of the present study is restricted by our 
range of values is most pertinent. As 
far as delay of KR is concerned, the 
range of variation seems adequate, 
Bilodeau and Bilodeau (1958) having 
shown that delays of KR up to week-long 
intervals had no adverse effects upon 
learning. As to the separation between 
Rest and Hold we are not quite as cer- 
tain. First, by Ss introspections, Hold 
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is a lot more tiring than Rest (see the 
section below). Second, our critique of 
the procedures Rest and Hold shows 
that neither is a condition of suspended 
activity. Third, the Rest-Hold differ- 
ences in the results were null. Thus, 
there are two good arguments that Rest 
is not especially different from Hold. 

Someone will no doubt suggest that 
the second argument above shows the 
need for a control group which is com- 
pletely free of interpolated activity. 
This type of control is not yet tech- 
nically feasible as is made clear in the 
debates over the techniques of freezing, 
hypnosis, and drugs in testing retroactive 
inhibition theory. 

Effects of Rest and Hold treatments.— 
There was no over-all difference between 
the two conditions of maintenance. This 
is a puzzle if nonmaintenance (Rest) 
implies a condition of relative rest be- 
tween line-drawing responses, and main- 
tenance (Hold) implies a condition of 
work. Having equated maintenance 
with a species of massed practice, and 
nonmaintenance with spaced practice, 
we might expect performance to favor 
nonmaintenance. It might be that a 
condition of massed activity plus less 
interference has been placed in balance 
against spaced activity plus greater 
interference. Though this is a_possi- 
bility, our preferred interpretation of 
the null result is that (a) the interfer- 
ences are essentially equal, and (bd) 
the differential fatigue exists but is 
irrelevant to the accuracy of line drawing. 
By analogy, the overtired lathe worker 
can produce sufficiently accurate pieces 
of work. Such pieces can be produced 
despite protestations of fatigue. In 
the laboratory, the ancient ergographolo- 
gists often succeeded in producing a 
good deal of fatigue though little finger- 
decrement. Similarly, while Hold Ss 
may be more fatigued than the Rest Ss, 
this does not imply that they cannot 
draw lines as well. It does imply that 
there is some activity which cannot be 
performed as well. This ‘‘some’’ ac- 
tivity need not necessarily be the same 
activity which produced the intervening 
state. Under the circumstances where 
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the state does depress the activity that 
produced the state, the decrement may 
be in one measure of response and not 
in another. 


SUMMARY 


This experiment replicates two groups of a 
line-drawing study by Greenspoon and Fore- 


man, i.e., 0- and 20-sec. delay of knowledge 
of results. Their data gave evidence of the 
efficacy of immediate knowledge of results 
The present experiment gave null results, 
the outcome being entirely consistent with 
a number of earlier studies. 

The Ss of the two groups mentioned above 
followed a “hand maintaining’ procedure 
in that the drawing hand and arm were 
kept off the table between responses, and in 
that E positioned S’s hand for the 
of each response. 


start 
Iwo other groups, also 
0- and 20-sec. delay, did not maintain the 
responding limb, but returned it to the lap 
and rested between responses. Neither pro- 
cedure led to differences between immediate 
and delayed information 
The maintenance procedure 


very tirin Nevertheless 
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between any maintenance and 
groups. This was inter 
preted to mean that fatigue is selective and 
need not impair the accuracy of all types of 
responses 
of trials. 


difference 
nonmaintenance 


even over an extended number 
Possible maintenance procedures ind how 
they might influence the trace of the line- 


drawing response are discussed. 
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EFFECTS OF SITUATIONAL ANXIETY AT DIFFERENT 
STAGES OF PRACTICE! 


JOAN L. BARDACH ? 


New York University 


The hypothesis investigated in the 
present experiment is that situational 
anxiety induced early in practice will 
have a more unfavorable effect upon 
performance than anxiety induced 
late in practice. Insofar as anxiety 
is viewed as a drive that energizes 
all activities that may be in progress, 
the stage in practice at which it is 
introduced should affect performance. 
Early in practice, the strength of the 
correct responses relative to that 
of the incorrect responses is less than 
it is later in practice. Indeed, early 
in practice, especially with difficult 
tasks, the incorrect responses may 
be the stronger, and introduction 
of anxiety therefore may be detri- 
mental to learning and performance. 
Later in practice, however, the strength 
of the correct responses may exceed 
that of the incorrect ones, so that 
the introduction of anxiety should 
facilitate performance, or at least be 
less detrimental than earlier intro- 
duction would be. 

If we consider anxiety as a class 
of responses, we can make the same 
prediction. The responses associated 
with anxiety possess a certain degree 


1 Based on a dissertation submitted in 
partial fulfillment of the requirement for 
the PhD degree at New York University. 
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thanks Ralph Gilbert and Robert Silverman, 
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of strength. If these responses are 
of such a nature that they interfere 
with the correct responses, the ob- 
served effects of this interference 
will depend upon the relative strength 
of these two kinds of reactions. In 
an early stage of practice, the strength 
of the correct responses is less than 
it is in a later stage. Hence the 
amount of impairment of performance 
produced by interfering responses 
associated with anxiety should be 


less in later stages of practice. In 


making this prediction, it is assumed 
that the strength of the anxiety 
reactions does not increase markedly 
during learning. 


METHOD 


Subjects.—The Ss for this experiment were 
volunteer, undergraduate men from New 
York University. Only male Ss were used 
in order to eliminate sex differences in sus- 
ceptibility to electric shock. 

Learning material.—The learning material 
was a list of 10 nonsense syllables, selected 
from Glaze’s list (1928) and of 26.67% 
association value. Nonsense syllables of low 
association value were used because previous 
experimental results indicated that such 
syllables present a learning task sufficiently 
difficult for learning to be impaired either 
by high anxiety, as measured psychomet- 
rically (Taylor, 1953), or by situational 
anxiety induced by unavoidable shock. 

Apparatus.—Each nonsense syllable was 
exposed for 2 sec. on a Patterson memory 
drum, Model 1B. The intertrial interval was 
20 sec. Shock was administered by means of 
a Porter inductorium adjusted for tetanizing 
shock; i.e., the vibrator make-break mecha- 
nism was used. The primary coil and make- 
break vibrator were in series with a telegraph 
key and one No. 6 dry cell. 

The electrodes were pieces of aluminum 
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side on a wooden block, 
against which S’s | 
Leads to the electrodes were taken from the 
secondary terminals of the induction coil. 
Procedure.—The S was asked to anticipate 
each syllable by spelling it. In accordance 
with the procedure of Silverman and Blitz 
(1956), all Ss were given a 


mounted side by 


were strapped 


Ivers 


series of pre- 
liminary training trials in which they were 
required to learn a list of six simple three- 
letter words that were presented in the same 
manner as the experimental task. Following 
the preliminary training, each S learned the 
experimental list of nonsense syllables to the 
criterion of two consecutive errorless trials. 
Four groups of Ss, 17 


used. 


per group, were 
At the prescribed stage of practice, 
S in an experimental group was suddenly 
told: ‘From now on you will receive electric 
shocks at any time While these instructions 
were being given, electrodes were attached to 
S’s fingers. The total time required for the 
ittaching the 
the « 
the trial following this procedure, 


and for 
electrodes was 30 sec. At 


verbal instructions 
mclusion of 
one shor k 
actually wa goiVen. H ywever, no shock was 
given at any later time, although presumably 
S continued to expect one. 

Shock was administered in the experi- 
mental groups either after S had learned 30% 
of the list, i.e., had anticipated three syllables 
correctly (Group E3) or after S had: learned 
70% of the list, i.e., had anticipated seven 
syllables correctly (Group E7 
the control for Group E3, was equated with 
the latter in terms of the number of trials 
to reach the criterion of 30% correct anti ipa 


tions on the experimental list. 


Group C3, 


Group C7, 
the control for Group E7, was equated with 
the latter in terms of the number of trials 
to reach criterion of 70% correct anticipations 
on the experimental list. Since the two 
experimental groups had an interval of 30 
sec. following the learning of 30% or 70% 
of the list, respectively, their control groups 
had an interval of the same length following 
the achievement of the criteria. In 
order to prevent rehearsal of the rote-memory 
task by the control Ss, they were asked simple 
questions, such as age, college major, et 
during this 30-sec. interval. 


same 


After completion of the learning task, each 
experimental S filled out a questionnaire 
designed to reveal his reactions to shock- 
warning and to shock-administration. At 
the conclusion of the experiment, all Ss were 
given an explanation of the experiment and 
attempts were made to allay whatever anxiety 
might have been aroused 
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RESULTS 

Results of the analysis of variance, 
using mean total trials to learn to 
criterion, are shown in Table 1. These 
data indicate no statistically reliable 
differences in performance among the 
groups as a whole, nor between the 
groups that were given shock and 
those that were not, nor between 
those groups that were interrupted 
early in learning and those that were 
interrupted ‘late. Further study of 
the interaction between time of inter- 
ruption and shock—no shock again 
revealed no differences. Comparisons 
of trials or errors to criterion for the 
various groups by ¢ tests (matched 
formula used where appro- 
two-tailed test of  signifi- 
cance) revealed only one significant 
(P < .05) difference between the 
means. Between Groups E7 and C7, 
the ¢ was 2.68 (P = .02, df = 16). 
This finding indicates that it is the 
late introduction of anxiety that 
results in impaired performance, rather 
than its early introduction as was 
the theoretical expectation. This 


groups 


priate ; 


result does not conform to predictions 
based on a view of anxiety either as a 


drive or as a class of interfering 
responses. 

However, qualitative data noted 
during the running of the experiment, 


as well as the results of the question- 
TABLE 1 


\NALYSIS OF VARIANCE OF 
TO CRITERION 


TRIALS 


Source 


n~ 


Between pairs 
B (7 vs. 3) 
Error (b) 

Within pairs 
A (E vs. C) 
AB 
Error (w) 

Total 


os) 


7) 
SIN eS we 


7) 


~ 
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TABLE 2 


NUMBER OF TRIALS TO LEARN 
ro CRITERION 


Groups Compared | N Mean 


E3 Compat. 6 
C3 Compat. | 6 


E3 Incompat. 11 
C3 Incompat. 11 | 


Compat. 
‘7 Compat. 


Incompat. 
Incompat. 


=3 Compat. 
E3 Incompat. 


Compat. 


7 
<7 Incompat. 


£3 Compat. 
E7 Compat. 


E3 Incompat. 
E7 Incompat. 


* P= 05. 


naire and a_brief terminal, interview 
given the experimental Ss, suggested 
that the attitudes induced by the 
introduction of shock were not the 
same for every S. Failure to take 
this fact into account in the statistical 
treatment of the data may explain 
the negative results obtained. 
Therefore, each experimental group 
was divided into two subgroups, 
based upon revealed differences in 
the attitudes induced by shock intro- 
duction. One subgroup was composed 
of Ss who claimed they were not 
disturbed by the shock or who be- 
lieved, despite the instructions, that 
they would be shocked for errors and 
therefore may have been spurred to 
greater effort to learn, or of those who 
felt that by diligent effort they could 
complete the experiment quickly and 
thus avoid further shock, or who did 
not continue to expect shock through- 


out most of the learning. This group 
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is called the Compatible group because 
the attitudes induced by the experi- 
mental conditions seemed to be com- 
patible with the requirements of the 
learning task. The designation E3 
Compat. and E7 Compat. refer to 
the experimental Ss who were given 
shock early or late in the learning, 
respectively, and in whom compatible 
attitudes were induced. Groups C3 
Compat. and C7 Compat. are com- 
prised of those control Ss who 
matched the experimental Ss in per- 
formance before the introduction of 
the experimental conditions. 

The second subgroup into which 
the experimental groups were divided 
consisted of those Ss who were dis- 
turbed by the shock, or who felt 
that shock would be administered 
randomly, or who continued to expect 
shock throughout most of the learn- 
ing. This group is called Incompatible 
because the attitudes induced by the 
experimental conditions seemed to be 
incompatible with the requirements 
of the learning task. The designation 
E3 Incompat. refers to the experi- 
mental Ss who given shock 
early in the learning and in whom 
incompatible attitudes were induced ; 
C3 Incompat. refers to their control 
group. E7 Incompat. and C7 In- 
compat. are the corresponding desig- 
nations for the experimental group 
that was given shock late in the 
learning and its control. 

Table 2 shows comparisons of 
performance when the compatibility 
or incompatibility of responses asso- 
ciated with the secondary anxiety 
drive are taken into account. As the 
table shows, Ss in the compatible 
group who received shock early in 
practice, learned significantly faster 
than their controls, but Ss in the 
compatible group who received shock 
late in practice did not. 

Performance data for 


were 


the incom- 
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patible groups are not simply the 
reverse of those obtained for the 
compatible groups. Table 2 shows 
that the experimental Ss in whom 
incompatible responses were induced 
learned significantly slower than their 
controls, regardless of whether shock 
was introduced early or late in prac- 
The evidence is that Group E3 
Incompat. did not take significantly 
learn to criterion than 
did Group E7 Incompat. 
because of the smal! N of 11 in each 
further necessary 
to establish this point definitely. 

In order to analyze the course of 
the effect, or of 


tice. 


more trials to 
However, 


group, research is 


the immediate and 
the subsequent effects, of shock intro- 
duction, apart from its total over-all 
effect upon N of trials (and of errors) 
the learning, 
3 were drawn. 


various 
Inspec- 
tion of the curves showing the mean 
number of correct responses for each 


to complete 
learning curves 


trial for the subgroups who received 
shock early in the learning discloses 
that, the the 
experimental conditions, the perform- 
ance of the experimental groups in 
relation to their controls is reversed. 
After shock, Group E3 Incompat. 
additional syllables more 
slowly than its control, at least for 
the subsequent 12 trials, but, after 
shock, Group E3 Compat. acquires 
additional faster than its 
control. 
Curves 


after introduction of 


acquires 


syllables 


to show 
the mean number of trials necessary 
to learn 1, 2, 3... 10 syllables, and 
to criterion. 


were also drawn 


Inspection of these 
curves for the subgroups who received 
shock early in the learning discloses 
that, after introduction of the experi- 
mental conditions, the performance 
of the experimental groups in relation 
to their controls is again reversed. 

These curves can be found in the original 
dissertation 


fewer 
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Group E3 Compat., throughout the 
remainder of the learning, acquired 
each syllable subsequent to shock in 
trials than its control, but 
Group E3 Incompat., throughout 
the remainder of the learning, 
quired each syllable subsequent to 
shock in more trials than its control. 

The comparable the 
subgroups who received shock late 
in the learning show that the experi- 
mental had little effect 
on performance of those individuals 
in whom compatible attitudes were 
induced. It may be that there was 


ac- 


curves for 


conditions 


these attitudes 


insufficient time for 
to take effect. 
show that when incompatible attitudes 


However, these data 


are induced even late in the learning, 
the acquisition of subsequent syllables 
is impaired. 


DISCUSSION 


The data for the groups as a whole 
before their division into subgroups in 
accordance with the compatibility or 
incompatibility of the induced responses 
indicate that only late introduction of 
anxiety impaired performance. If these 
findings had been the only ones available, 
the results of the experiment 
have been negative, and moreover, 
would have been difficult to explain in 
terms of any current theoretical position. 

However, when the experimental groups 
are classified according to the type of 
induced associated with the 
introduction of shock, differences in 
performance appear that seem more 
understandable. Now, if the responses 
to anxiety are with the 
learning task, early introduction of 
anxiety is facilitating, although late 
introduction remains ineffective. If the 
responses are incompatible, both early 
and late introduction of anxiety is im 
pairing, although no difference in the 
degree of impairment is demonstrated. 
These results support the rather obvious 
notion that if anxiety is associated with 
incompatible (interfering) i 


would 


response 


compatible 


responses, it 
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will impair learning and performance, 
but they also indicate that if anxiety 
is associated with compatible responses, 
it may significantly facilitate learning 
and performance, even when task dif- 
ficulty is held constant. 

However, one problem in the inter- 
pretation of the results remains, namely, 
that when anxiety is introduced late 
rather than early in practice, even those 
Ss in whom compatible responses were 
evoked were inferior to their controls. 
A possible explanation of this finding 
is as follows. In a situation in which 
compatible responses are induced, a 
time-lag probably occurs because time 
is required for the compatible responses 
associated with the anxiety to produce 
their potential effects on performance. 
Table 2 shows that the difference in per- 
formance between Groups E7 Compat. 


and its control is not significant. It may 


be that the introduction of shock in this 
experimental group occurred too late in 
learning for the effects of compatible at- 
titudes to be reflected in performance. 
The present results suggest that anxiety 
can best be conceptualized as having 


both drive properties and associated 
responses, a view that is similar to that 
of Mandler and Sarason (1952). If 
accurate predictions of performance are 
to be made, both of these aspects must 
be taken into account. The data suggest, 
in addition, that responses associated 
with the introduction of shock are 
different for different Ss, so that accurate 
predictions require that these individual 
differences also be considered. However, 
since the conclusions from the present 
study are based on post hoc considera- 
tions, they must be regarded as tentative 
until cross-validating studies are made. 


JOAN L. BARDACH 


SUMMARY 


\n experiment tested the hypothesis that 
situational anxiety induced early in practice 
will have a more unfavorable effect upon 
performance than anxiety induced late in 
practice. Four groups of Ss learned a difficult 
list of nonsense syllables. One group was 
given unavoidable electric shock after they 
had learned 30% of the list; another group 
was given unavoidable shock after they had 
learned 70% of the list. Each experimental 
group was matched on preshock performance 
with a control group. 

The data showed that it was late rather 
than early introduction of situational anxiety 
that impaired performance, indicating a need 
for modification of the original hypothesis. 
When each experimental group was divided 
into two subgroups in accordance with Ss’ 
reported attitudes toward shock, it was found 
that the different responses induced by the 
shock had significant effects on performance. 
rhe results suggest that when shock induces 
responses that are compatible with the 
learning task, facilitation of performance may 
occur, but that when shock induces responses 
that are incompatible with the learning task, 
performance is impaired regardless of when 
in the course of learning shock is introduced. 
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PERCEPTION OF VERTICALITY !? 
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In a previous study (Pearson & 
Hauty, 1959) with the 
function of learning in the propriocep- 
tive perception of verticality, it was 
found that alternation of the direc- 
tion of tilt on successive trials effected 
a progressive reduction in the error 
of perceived verticality. In account- 
ing for this effect, two possibilities 
were discussed: the learning of addi- 
tional and unique proprioceptive cues 
to be gained from alternation and 
the nullification of effects of sensory 
adaptation which also could have 
been provided by this experience. 

The present study was designed 
for the purpose of determining the 
effects of adaptation imbalance and of 
increase in the delay at tilt upon the 
error of perceived verticality. Should 
these effects attenuate reduction in 
error, sensory adaptation may be 
regarded as the predominant factor. 
If not, then support for propriocep- 
tive learning will have been realized. 


concerned 


METHOD 


A pparatus.—The tilting-chair used in this 
study has been described in detail elsewhere 
(Pearson, 1957). The seat was adjustable 
to the sitting height of S so that the medial 


1 The contents of this manuscript reflect 
the personal views of the authors and are not 
to be construed as a statement of official 
Air Force policy. 

2? Now at the Civil Aeromedical Research 
Institute, FAA, Oklahoma City. 


axis at eye level would be approximately at 
the point of chair rotation. The S’s control 
panel contained adjustable, leather sleeve 
cuffs to support his forearms at positions of 
tilt, and a three-position switch providing 
him with directional control of the chair 
which, of course, was also under the inde- 
pendent control of E. The test room itself 
was lightproof and airconditioned. Both red 
and white light were available to EZ at his 
discretion. 

Design.—The experimental variables were 
of the following nature: (a) Alternation, or 
directional sequence of tilt, within each of 
the two magnitudes of tilt: A;—no alterna- 
tion, in which all trials involved the tilting of 
S to his left; A:—alternation, in which S 
was tilted to the left, either 10° or 30°, on all 
odd-numbered trials and 10° to the right 
on all even numbered trials. (6) Tilt magni- 
tude: T,;—10° left; T.—30° left. (c) Delay 
at tilt: D;—no delay; D.—30 seconds delay; 
D;—60 seconds delay. 

The Ss were assigned in equal numbers 
and randomly to each cell of the 2 KX 2 K 3 
factorial design. The experimental sample 
was obtained from a population of volunteer, 
experimentally naive, basic Air Force trainees. 
Four Ss were tested per day until a sample 
of 96 was attained. 

Procedure.—The S was secured in the 
tilting-chair, then was indoctrinated in his 
task. He was told that the test session would 
consist of 20 trials. If he had no questions, 
testing then commenced. The E moved S 
to a position of tilt, then, following the delay 
period (if any), transferred control of the 
chair to S who made his judgment. When 
satisfied with his position, S responded “OK” 
or “Allright.” The E then recorded the direc- 
tion and amount of error. Thereafter, E 
proceeded in accordance with the “alterna- 
tion’”’ condition to which S was assigned. 
Delay at the perceived vertical was 20 sec. 
in all cases. 


425 





RICHARD G. 


TABLE 1 


ANALYSIS OF VARIANCE OF 
ABSOLUTE ERRORS 
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RESULTS 
In accordance with the main inter- 
ests of this the 


analysis was confined to the absolute 


paper statistical 


magnitude of error in perception of 


DEGREES ERROR 


MEASURES 


ic. 1. The effects of alternation upon 
perception of the postural vertical. 
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ALTERNATION 





‘}-. . ty, oamiorR 7 
Troy’ yal ennred 





i 3 «15 


MEASURES 


iG. 2. 


perception of the postural vertical. 


rhe effects of magnitude of tilt upon 


the vertical for 
trials only. 


the odd-numbered 
Results of a repeated- 
measurements analysis of these data 
are shown in Table 1. Of particular 
concern to the authors is the finding 
of significant effects attributable to 
both the Alternation and Tilt 
ables. 


vari- 
1 the alterna- 
tion subgroup (10° R), as compared 
with the ‘“‘control’”’ subgroup, demon- 
strates a marked reduction in error 
beginning at Trial 3, and maintains 
this superior performance throughout 
testing. Figure 2 plots the effects 


As seen in Fig. 


of magnitude of tilt upon perception 
of the vertical, and, as one would 
expect, the 30°-tilt subgroup has a 
much larger error of adjustment than 


the 10°-tilt subgroup. It should be 
noted at this point that, although 
both Alternation and Tilt main effects 
are significant, neither the simple 
interaction, A X T, nor the second- 
order interaction, M XK A XT, ap- 
proach significance. The importance 
of these findings are discussed below. 

The factor of Delay has no signifi- 
Table 1, 


the Delay F ratio is not significant, 


cant effect. As shown in 
and, furthermore, none of the inter- 


actions involving the Delay variable 





PROPRIOCEPTIVE PERCEPTION OF VERTICALITY 


with the other main variables are 


The absolute mean errors 
of adjustment for the 0-, 30-, and 
60-sec. delays were 4.64°, 5.05°, and 
4.59° respectively. 


significant. 


The two significant Fs, Measures 
M XT, are doubtless attribu- 
table to the progressive reduction in 
the 30°-tilt 


and 


error achieved by sub- 


group (see Fig. 2 


DISCUSSION 


The primary concern of the present 
study was with the effect of alternation 
upon perception of the postural vertical. 
On odd-numbered trials all Ss 
tilted to the left; on even-numbered 
trials half of these Ss were tilted a small 
imount past the true vertical to the 
right while the other half were tilted 
again to the left. It was then demon- 
strated that perception of the vertical 
by the alternation subgroup was signifi- 
cantly superior to that of the control 
subgroup. Further, since the interac- 
tions, AX T and MXAXT were 
found not to be significant, it seems 
reasonable to assume that adaptation 
imbalance did not attenuate progressive 
reduction in error of ver- 
ticality. Similarly, neither the 
F ratios for the Delay variable nor those 
involving interaction of Delay with the 
other main variables proved to be 
significant, one must conclude that 
increased delay at tilt also did not pro- 
duce such attenuation. Such findings 
lead the authors to conclude that 
proprioceptive learning, rather than sen- 
sory adaptation, is the determining factor 
in progressive reduction in the error of 
perceived verticality. 

The confirmation of our previous 
finding (Pearson & Hauty, 1959) that 
the 
variable in 


were 


perceived 
since 


“passing through vertical” is an 


important perception of 
postural verticality may have important 
implications regarding the results of 
work by Mann, Berthelot-Berry, and 


Dauterive (1949) and Solley (1956). 
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They employed the practice of moving 
S to the opposite quadrant before moving 
him to the trial position as a precaution 
against the possibility that S would 
count himself back to the true vertical. 
It is now clear that this procedure 
plays a definite and positive role in 
reducing errors of perception of ver- 
ticality. Consequently, it is question- 
able whether the ‘‘true’’ effects of such 
variables as delay at tilt and rate of 
chair movement have 
studies wherein alternation 
employed as a precautionary measure. 
In agreement with results reported 
by Clegg and Dunfield (1954) is the 
finding that delay did not have a signifi 
cant effect upon perception of verticality. 
This confirmation supports a previous 
1959).: 
namely, that averaging the results of 


been isolated in 


has been 


conclusion (Pearson & Hauty, 


several consecutive trials does not pro- 
vide a clear picture of the state of per- 
formance in these studies. Performance 
curves may be expected to reveal that 
with repeated testing certain variables 
do not have a significant effect upon 


performance. 


SUMMARY 


Ninety-six Ss were tested on a lateral-tilt 
chair under darkroom conditions. The task 
consisted of 20 trials and required S to return 
himself from a position of tilt to the perceived 
vertical. It found that neither the 
procedure of effecting an adaptation im- 
balance by tilting S 30° L, then 10° R, on 
alternate trials, nor that of increasing the 
delay at tilt to 30 or 60 sec 
gressive reduction in error of perceived ver- 
ticality. This would suggest proprioceptive 
learning, rather than sensory adaptation, to be 
the factor of principal importance 


was 


attenuated pro- 
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SUPPLEMENTARY REPORT: THE YERKES-DODSON LAW AND SHIFT 
IN TASK DIFFICULTY! 
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Purdue University 


Broadhurst (1957) has reported a confirma- 
tion of the Yerkes-Dodson Law by use of an 
underwater Y-maze in which motivation was 
manipulated by the amount of oxygen 
deprivation prior to the release of S, and 
task difficulty was varied by changing the 
relative difference in brightness between the 
two arms of the Y. He also reported that 
emotional Ss exhibited a higher drive level 
than nonemotional Ss. We used Broad- 
hurst’s technique to study the effects of 
infantile handling since it has been suggested 
that such handling modifies S’s emotionality. 
We report here a pilot study using a straight- 
way as well as some of the Y-maze data of our 
control Ss from the infantile experience study. 
These groups may be considered to provide 
a partial replication of Broadhurst’s study 
with different aged Ss and with somewhat 
different delay intervals. 

Vethod 
was used. 
bulbs 
incorrect alley served as the barrier. 
correct alley had no barrier. 


\ replica of Broadhurst’s maze 
It was illuminated by two 100-w. 
A black wooden block at the end of the 
The 
The difference 
in alley illumination was determined by a 
Western illumination meter, No. 756, 
filters removed The difference 
ratio of 1.50. 

In a pilot study 5 trials per day were given 
21 Harvard Wistar rats (55-61 days old) ina 
4-ft. underwater straightway. 
of 0, 3, 6, 9, 12, 15, and 18 sex 
a 7X7 latin square which 
days and test sequences. 
was recorded in .01 sec. 


with 
was in the 


Time delays 
were used in 
balanced test 
Swimming time 
In the Y-maze study, 
time intervals of 0, 3, 6, and 12 sec. were 
used, with 14 Ss randomly assigned to each 
condition. Starting at 5i days Ss were given 
5 trials a day for 10 successive days. The 
from the moment that the 
startbox was opened until S’s head broke 
water was recorded, as was the number of 
errorless trials per day. 

Results.—The pilot study yielded signifi- 
cant differences (P < .01) among Days and 
Delay Conditions in an analysis of variance of 


time in .O1 sec. 


1 This study was in part supported by a grant from 
the Purdue Research Foundation and in part by Re- 
search Grant M-1753 from the National Institute of 
Mental Health of the National Institutes of Health, 
United States Public Health Service. 

2Presently at Department of 


Iowa 
State University. 


Psychology, 


swimming times. The plot of time against 
days is similar in form to Broadhurst’s Fig. 1. 
The relationship between swimming time and 
delay time is highly irregular, but increased 
time tends to be associated with longer delay 
intervals. 

The Y-maze Ss averaged 86.6 errorless 
trials out of 100. This is similar to Broad- 
hurst’s “‘easy”’ discrimination task where he 
found a mean of 86.4 errorless trials out of 100. 
Analyses of the log time scores and number 
of errorless trials failed to yield significant 
differences among the motivational levels. 
However, a significant Days X Motivation 
interaction (P < .01) was found for the time 
data. The errorless trial data showed that 
Ss had virtually learned the discrimination 
by the end of Day 5, making 97.3% correct 
responses during Days 6-10. Thus behavior 
during Days 1-5 reflects acquisition of the 
response; behavior during Days 6-10 reflects 
performance of the acquired response. The 
swimming times were separately analysed 


y, 








rn 4 
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DELAY 


_ Swimming time as a function of delay 
interval for Days 1-5 and 6-10. 
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for each 5-day period as a function of delay 
interval. Analyses of these curves (Fig. 1) 
yielded a significant (P < .01) quadratic 
(curvilinear) effect between swimming time 
and delay time during Days 1-5 and a sig- 
nificant (P <.01) linear function during 
Days 6-10. 

Discussion.—Broadhurst, in his straight- 
way study, tested Ss first at the 0-sec. delay, 
then at the 5-sec. delay, etc., thus confound- 
ing practice effects and delay intervals.* Our 
data, which separately evaluated these two 
variables, are not consistent with his, but 
neither are they consistent with our Y-maze 
It appears that the use of a design 
in which Ss are tested under different delay 
times is not valid because of differential 
transfer effects. 

Our light-dark ratio was approximately 
1:50 while Broadhurst’s was about 1:300 for 
his “easy”’ task, but the two studies report 
almost the same percentages of correct 
responses. However, Broadhurst’s “‘moder- 
ate”’ task had a light-dark ratio of 1:60 


results. 


‘ 


3P, L. Broadhurst. Personal 


September i958. 
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(83.1% correct responses); and our data 
should have come closer to approximating the 
findings of his “moderate’’ rather than his 
“easy” task. The failure to do so can prob- 
ably be accounted for by the ages of Ss in- 
volved (213 or 322 days vs. 51 days) and 
strain differences. 

Even though the main effect of Motivation 
was not significant, our Y-maze results, like 
Broadhurst's, confirm the Yerkes 
Law when task difficulty is changed by 
repeated experience. The task changed 
markedly in difficulty between Trials 1-5 
and 6-10. According to the Yerkes-Dodson 
Law, optimal motivation should be less during 
the more difficult learning trials (Trials 1—5) 
than during the latter trials. This effect 
was obtained, and suggests that change in 
task difficulty as a function of repeated prac- 
tice must be taken into account when testing 
the Yerkes-Dodson Law. 


Dodson 
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SUPPLEMENTARY REPORT: FURTHER FACTORS AFFECTING THE 
PROBABILITY OF CHANGING RESPONSES ! 


MAYNARD W. SHELLY II 
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In a previous study (Shelly, 1958) the 
effects of response contingent probabilities 


upon response change was studied. Two 
factors were identified: changes are condi- 
tional upon (a) repetition not being rein- 
forced, and (b) the probability that a change 
results in a reinforcement. This supple- 
mentary report extends these results. 
Method.—All procedures, including num- 
bers of Ss per condition, were identical to 
those used in Shelly (1958). In the preceding 
study the conditions were presented by 
matrices, each row of which differred from 
the one above it through moving the elements 
one place to the right. Thus, to specify a 
condition it becomes necessary only to give 
! This research was completed at the University of 
Wisconsin while the author was a Social Science Re- 
search Council Research Training Fellow. Some of 
these results are included in the author’s PhD disserta- 
tion. The author wishes to thank the Office of Naval 
Research for providing him with time, and the Depart- 


ment of Psychology, Northwestern University for the 
facilities to complete this research. 


the first row with the understanding that the 
first position specifies the probability that 
repetition will be reinforced. Using this 
method of presentation, five nonexclusive 
sets of conditions can be described: Set A: 
{(.00, 1.0, .00, .00), (.00, 1.0, 1.0, .00), (.00, 
1.0, 1.0, 1.0)} ; Set B: {(.00, .33, .33, .33), (.00, 
.50, .50, .00), (.00, .50, .50)} ; Set C: {(.50, .00, 
.00, .00), (.50, .25, .00, .00), (.50, .50, .00, .00), 
(.25, .25, .25, .25), (.50, .50, .50, .50)}; Set D: 
'(.50, .50, .00, .00), (.50, .25, .25, .00), (.50, 
.16, .16, .16)}; and Set E: {S = (.75, .25, .00, 
.00), T = (.25, .75, .00, .00), Z = (.00, .75, 
.25, .00)}. For Set E, the labeling of condi- 
tions is done upon the basis of the first letter 
of the percentage of reinforcement associated 
with repetition, e.g., S for 75%. To refer to 
any other condition, e.g., the first condition 
of Set A, the notation A, will be used. 
Results and conclusions.—Within the sets 
of conditions, methods of analysis identical 
to those used in the previous paper, including 
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2.8 
(b) 


BLOCKS OF 100 TRIALS 


Fic. 1 


a constant confidence level of 05, will be 
used “Po” stands for the probability of 
changing responses on successive trials. To 
indicate that, for example, Pc for A; and Az 
was greater than Pc for A» on Blocks 3 and 
$ of 100 trials, the short hand 
(Pe( Ay, Az) > Pe( Az)! 3, 4) will be used 

The following are the conclusions together 
with the supporting results. 


notation 


1. Increasing the reinforcement of repeti- 


tion, even though changing responses leads 
to equal reinforcement, reduced Pe: (Pce(C, 
> Pe(C;)}1, 2, 3, 4). 

2. A larger reinforcement for a specifi 
change appears to be superior in increasing Pc 
than the same probability spread over a num- 
ber of alternatives: (Pco(D,) > Pe(D;)|2). 

3. The differences in (2) were small, how- 
ever, and when the number of alternatives 
to which a change can be made is increased 
without any decrease in the probability that a 
change can lead to a reinforcement, then Pc 
may increase: (Pe(A3) > Pe(A,)|1). 

4. In agreement with Conclusion 2, when 
the probability of a single change leading to 
a win is increased without altering the proba- 
bility of reinforcing repetition, Pc, if it 
changes significantly, increases: (Pc(C;) 
> Po(Ca, C;)|2); (Pe(Cs;) > Po(C;) |4). 


5. There was some evidence that number 


Probability of responding to an alternative when repetition is reinforced with 


varying probabilities. 


of available alternatives had an initial effect 
on Pe. Although the only significant relation 
was (Pc(B:) > Pc(B;)!1), Pc( Bz) lay regu- 
larly between the other two corresponding 
probabilities. 

6. That the choice of an alternative on 
trials can be influenced by the 
value of the conditional probability of rein- 
forcing repetition can be seen in Fig. 1. In 
this figure A;; refers to that alternative which 
on a given trial has the probability of rein- 
forcement of .75 associated with it; corre- 
sponding interpretations apply to Aes and Ap. 
Where two values do not differ significantly 
for the same block of 100 trials, they are 
combined into a single point. If the proba- 
bility of reinforcement were independent of 
changing or repeating responses, all three 
graphs would exhibit but a single curve. 

The preceding results extend the findings 
of Shelly (1958) and would seem to indicate 
that factors influencing changing responses 
need to be considered in predicting rate of 
learning. 


suc cessive 
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SUPPLEMENTARY REPORT: FAMILIARITY AND FREQUENCY! 
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The writer has proposed (Noble, 1954) a 
hyperbolic law connecting judged familiarity 
(f) and frequency of occurrence (m) for un- 
familiar and meaningless verbal stimuli. The 
empirical data, reproduced in the upper half 
of Fig. 1, were fitted by the equation, 


n 


= [1] 


an +b 


where f denotes successive-intervals trans- 
formations in z units of familiarity ratings 
(Noble, 1953), » is the physical number of 
exposures, and a and b are empirical con- 
stants. An index of correlation (p), calcu- 
lated from a linear plot of reciprocals (1/fa1 /n) 
by the product-moment method, gave a value 
based on 288 Ss of .998 (N = 9). Equation 
1 thus accounts for 99.6% (p? K 100) of the 
variance in the dependent variable. 

Repeating this experiment with random 
visual shapes on 600 Ss from the same 
population, but using a graphical scaling 
method, Arnoult (1956) obtained a similar 
curve. To describe it, however, he 
the logarithmic function, 


che se 


f=a log (n + b) —c [2] 
Arnoult reported a higher index of correla- 
tion for a semilog reduction of Equation 2 
(p = .997), in which 6 = 1, than for the 
linear plot resulting from taking reciprocals 
of Equation 1 (p = .950).2,. Comparing p 
values from reductions of Equations 1 and 2, 
however, is dubious practice because mathe- 
matically different transformations are in- 
volved. One should not forget that p? then 
describes the precision of the general law 
modified by scale characteristics. 

Goodness of fit depends, furthermore, 
upon the particular scaling operations fol- 
lowed in calculating familiarity scores and 
upon whether any ad hoc constants are added 
to the independent variable n. Limiting our 


1 This work was supported in part by a contract 
between the office of Naval Research and Montana 
State University. R.W. Chambers and Q. A. Leland 
checked the statistical analyses. 

Reproduction in whole or in part is permitted for any 
purpose of the United States Government. 

2 Arnoult kindly made available some of his data 
for reanalysis in our laboratory. His index of .950 
(Arnoult, p. 274) for a linearized Equation 1 is er- 
roneous; the value of p applied to data derived by the 
Garner-Hake method is actually .974 (N = 10). 


attention to the most comparable aspects 
of these experiments, we find that if the same 
(Thurstone-Edwards) methodology used by 
Noble (1953, 1954) is applied to Arnoult’s 
original frequency distributions, the rectangu- 
lar hyperbola shown in the lower half of Fig. 
1 results. Its p value derived from a recti- 
fication of Equation 1 is .997 (N = 10), 
accounting for 99.4% of the variance. If we 
apply Equation 2 in semilog form to the 
same data, the p value is .9917 (N = 10), 
explaining only 98.3% of the variance in 
familiarity (f). A still smaller index of 
determination might result if Arnoult’s 5-hr. 
delay group were eliminated their 
ratings tended to be higher. 

When procedures for measuring the de- 
pendent variable are thus equated, Arnoult’s 
data confirm Noble’s finding that familiarity 
(f) is determined almost uniquely by fre- 
quency of exposure (m), and that this de- 
pendency grows according to a hyperbolic 
law. The precision and generality of Equa- 
tion 1 is striking; only the slope (b) and 
intercept (a) parameters of the reduction 
line, 1/f = b(1/m) +a, change when the 
task and stimulus characteristics are experi- 
mentally altered. Errors of measurement are 
reduced to the order of 1°% in both cases. 

Because of the “extraordinary accuracy” 
of the empirical fits of Equations 1 and 2, 
a casual reader may be tempted to conclude 
that these two sets of observations are 
“genuinely different’ (Arnoult, 1956, p. 275). 
After descriptively asserting that ‘‘f is a 
monotonic, negatively accelerated function 
of n’’ (p. 272), Arnoult proceeded to embrace 
the notion of an infinitely-rising curve (p. 
275) which is conventional among perception 
researchers. Hyperbolas have asymptotes, 
but logarithmic curves are unbounded. On 
the hyperbolic hypothesis, the calculated 
asymptote (1/b) for Noble’s data has a 
maximum f value of 1/.130.= 7.69 z units, 
while that for Arnoult’s data has an f value 
of 1/.358 = 2.79 z units. A relatively simple 
experimental resolution of the issue would be 
possible by extending the range of n values 
from 0 out to frequencies of 50 or 100. If 
Equation 1 is the correct form of the famili- 
arity-frequency law, very little further rise 
should occur in the two functions as nm in- 


since 
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creases without bound. Since the f 
lie on an interval scale, the 0-25 segment 
shown in Fig. 1 would have to be re-explored 
and the same Ss should be employed at all 
points on the curve. 


scores 


Until the range of observations is increased 
well beyond nm = 25, one may continue to 
entertain an hypothesis (Equation 1) which 
predicts behavior with less than 1% error, 
which specifies a rational origin and finite 
asymptotes, which is more parsimonious than 
Arnoult’s special logarithmic function (Equa- 
tion 2), and which applies equally well to two 
classes of stimuli. Referring again to Fig. 1, 
the writer reaffirms his contention (Noble, 
1953) that familiarity is a learnable attribute 
of stimuli, and that Fig. 1 contains acquisition 
curves rather than psychophysical functions. 
This associational hypothesis derives not 
from hyperbolic-logarithmic distinctions but 
from a consideration of the fundamental 
operations involved in the process of familiar- 
ization (Noble, 1954, p. 15). The S-R law 
represented by Equation 1 appears to hold 


The familiarity-frequency relationship for two different classes of 


stimuli. 

as well for the stimulus class of random 
shapes (Arnoult, 1956) as for that of mean- 
ingless words (Noble, 1954). This is not 
true of Equation 2, whose goodness-of-fit 
index is: only 94.65% (N =9). In every 
instance’ the hyperbolic law predicts the data 
better than the logarithmic law when famili- 
arity is defined by the original scaling opera- 
tions ( Noble, 1953). 

Finally, experimental attempts to delimit 
the law’s invariance by systematically ex- 
ploring other sensory modalities should be 
undertaken in conjunction with theoretical 
studies designed to clarify the basic learning 
process by means of which frequency exerts 
its effect on familiarity. 
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Buss and Buss (1956, Exp. 1) found that 
using the verbal reinforcement combinations 
Right-Wrong (RW) and Nothing-Wrong 
(NW) led to faster learning than did Right- 
Nothing (RN), and that the former two 
combinations were approximately equally 
effective. The present experiment is an 
extension of the previous study using a new 
population (normal children) and a new 
response measure (errors in a fixed series 
of trials). 

Method.—The same stimulus materials 
and procedure were used as in the previous 
study with the following exceptions. Each 
S was given 150 cards to sort with correct 
sorting for the first 50 cards on the basis of 
color and for the next 100 cards on the basis 
of number. Ninety grade school pupils were 
assigned to three groups of boys and three 
groups of girls. Each group was matched for 
1Q level on the basis of the Otis Quick Scoring 
Test of Mental Ability, and each group 
contained identical proportions of Grades 
5 and 6. One group of each sex was assigned 
to each of the three verbal reinforcement 
combinations. 

Results.—The means and SDs of the total 
number of errors made by the three treatment 
groups during each sorting are shown in 
Table 1. These data were subjected to 
analyses of variance (Treatments X IQ level 
< Sex) with each basis for sorting being 
considered separately. The effects of Treat- 


TABLE 1 


MEANS AND SDs oF ERRORS FOR VERBAL 
REINFORCEMENT COMBINATION GROUPS 


Color-sorting Number-sorting 


Group 


" 

Mean | SD 
32.97 
43.93 
32.68 


28.83 
44.90 


27.97 


14.83 
26.87 
14.16 


1 This paper is based on an M.A. thesis submitted to 


the graduate school of Indiana University. The author 
wishes to express her appreciation to A. M. Buchwald, 
thesis chairman, for his many helpful suggestions. 
The major results of this study were presented at the 
1959 meeting of the APA. 


ments were significant at the .01 level for 
color-sorting and at the .05 level for number- 
sorting. A sex difference was also significant 
at the .05 level for color-sorting, the boys 
making fewer errors than the girls. No 
other differences were significant. For both 
color- and number-sorting, ¢ tests revealed 
that the RN group was significantly different 
from the NW and RW groups. The latter 
two groups did not differ significantly in 
either instance. 

Discussion.—The results agree with those 
obtained by Buss and Buss (1956) for adults 
on a similar task. However, the results are 
at variance with those reported by Terrell 
and Kennedy (1957). These investigators 
studied the effects of a number of types of 
reinforcement on learning in children. Among 
these were E’s verbalizations to indicate only 
correct responses for one group and only 
incorrect responses for a second group. AIl- 
though these two groups are somewhat 
comparable to the RN and NW groups of the 
present experiment, there were no significant 
differences in the effectiveness of these two 
combinations. However, the Terrell and 
Kennedy procedure also included a light 
flash which indicated correct responses to all 
Ss. In the present experiment, and in that 
reported by Buss and Buss, the S’s only 
source of information about the correctness 
of a response was the assigned verbal rein- 
forcement combination. The difference be- 
tween these results and those obtained by 
Terrell and Kennedy may be a result of the 
presence of additional information provided 
to Ss about their responses in the latter 
experiment. It seems possible that the 
differential effectiveuess of the verbal rein- 
forcement combinations may be dependent 
on there being no other information about 
the conwectness of a response available. 
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